Deep generative models trained on sensitive data pose significant privacy risks, yet enforcing differential privacy (DP) in high-dimensional generators often leads to severe utility degradation. We propose Differentially Private Vector-Quantized Generation (DP-VQG), a three-stage generative framework that introduces a discrete latent bottleneck as the interface for privacy preservation. DP-VQG separates geometric structure learning, differentially private discrete latent projection, and non-private prior modeling, ensuring that privacy-induced randomness operates on a finite codebook aligned with the decoder’s effective support. This design avoids off-support degradation while providing formal end-to-end DP guarantees through composition and post-processing. We provide a theoretical analysis of privacy and utility, including explicit bounds on privacy-induced distortion. Empirically, under the privacy budget of ε=10, DP-VQG attains Fréchet Inception Distance (FID) scores of 18.21 on MNIST and 77.09 on Fashion-MNIST, surpassing state-of-the-art differentially private generative models of comparable scale. Moreover, DP-VQG produces visually coherent samples on high-resolution datasets such as Flowers102, Food101, CelebA-HQ, and Cars, demonstrating scalability beyond prior end-to-end DP generative approaches.
Ge et al. (Thu,) studied this question.