Key points are not available for this paper at this time.
Automatic anime sketch colorization aims to generate a color image from a sketch image, which is challenging due to limited structure and semantic understanding, leading to constrained style, and semantic color inconsistency. In this paper, we introduce a sketch to color diffusion model with semantic prompt learning (SPL), learning better semantic prompts to stimulate the powerful structure and semantic understanding capabilities of large-scale multi-modal diffusion models, effectively bridging the gap between sketch and color. We introduce two distillation strategies for learning semantic prompts: one is prediction-level distillation by optimizing the global knowledge distillation loss and the local activation knowledge distillation loss, and the other is feature-level distillation, which optimizes hierarchy-wise feature distillation loss to transfer knowledge to output features of different hierarchies in the model. The experimental results show that our proposed distillation strategies generate high-quality semantic prompts, resulting in image quality that exhibits a superior visual effect compared to current automatic anime sketch colorization methods.
Building similarity graph...
Analyzing shared references across papers
Loading...
Ning Wang
Ningde Normal University
Yifei She
Rui Xu
Dalian University of Technology
The University of Sydney
Dalian University of Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Wang et al. (Mon,) studied this question.
synapsesocial.com/papers/68e73894b6db6435876b1f24 — DOI: https://doi.org/10.1109/icassp48485.2024.10448330