May 23, 2024Open Access

Semantica: An Adaptable Image-Conditioned Diffusion Model

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We investigate the task of adapting image generative models to different datasets without finetuneing. To this end, we introduce Semantica, an image-conditioned diffusion model capable of generating images based on the semantics of a conditioning image. Semantica is trained exclusively on web-scale image pairs, that is it receives a random image from a webpage as conditional input and models another random image from the same webpage. Our experiments highlight the expressivity of pretrained image encoders and necessity of semantic-based data filtering in achieving high-quality image generation. Once trained, it can adaptively generate new images from a dataset by simply using images from that dataset as input. We study the transfer properties of Semantica on ImageNet, LSUN Churches, LSUN Bedroom and SUN397.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Kumar et al. (Thu,) studied this question.

synapsesocial.com/papers/68e68cfdb6db643587614de4 — DOI: https://doi.org/10.48550/arxiv.2405.14857

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Authors

M. Kumar

Chandigarh University

Neil Houlsby

Google (United States)

Emiel Hoogeboom

Google (United States)

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Semantica: An Adaptable Image-Conditioned Diffusion Model

Puntos clave

Resumen

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider