Los puntos clave no están disponibles para este artículo en este momento.
Many deep generative models, such as variational autoencoders (VAEs) and generative adversarial networks (GANs), learn an immersion mapping from a standard normal distribution in a low-dimensional latent space into a higher-dimensional data space. As such, these mappings are only capable of producing simple data topologies, i.e., those equivalent to an immersion of Euclidean space. In this work, we demonstrate the limitations of such latent space generative models when trained on data distributions with non-trivial topologies. We do this by training these models on synthetic image datasets with known topologies (spheres, torii, etc.). We then show how this results in failures of both data generation as well as data interpolation. Next, we compare this behavior to two classes of deep generative models that in principle allow for more complex data topologies. First, we look at chart autoencoders (CAEs), which construct a smooth data manifold from multiple latent space chart mappings. Second, we explore score-based models, e.g., denoising diffusion probabilistic models, which estimate gradients of the data distribution without resorting to an explicit mapping to a latent space. Our results show that these models do demonstrate improved ability over latent space models in modeling data distributions with complex topologies, however, challenges still remain.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yinzhu Jin
Brigham and Women's Hospital
R. McDaniel
Oldham Council
N. Joseph Tatro
Rensselaer Polytechnic Institute
Frontiers in Computer Science
Duke University
University of Virginia
University of Wisconsin–Stout
Building similarity graph...
Analyzing shared references across papers
Loading...
Jin et al. (Mon,) studied this question.
synapsesocial.com/papers/68e5add7b6db643587547e2f — DOI: https://doi.org/10.3389/fcomp.2024.1260604