Key points are not available for this paper at this time.
Designing complex 3D scenes has been a tedious, manual process requiring domain expertise. Emerging text-to-3D generative models show great promise for making this task more intuitive, but existing approaches are limited to object-level generation. We introduce locally conditioned diffusion as an approach to compositional scene diffusion, providing control over semantic parts using text prompts and bounding boxes while ensuring seamless transitions between these parts. We demonstrate a score distillation sampling-based text-to-3D synthesis pipeline that enables compositional 3D scene generation at a higher fidelity than relevant baselines.
Building similarity graph...
Analyzing shared references across papers
Loading...
Po et al. (Mon,) studied this question.
www.synapsesocial.com/papers/68e7375cb6db6435876b0a9b — DOI: https://doi.org/10.1109/3dv62453.2024.00026
Ryan Po
Gordon Wetzstein
Stanford University
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 2 closely related papers on similar clinical questions. Consider them for comparative context: