Key points are not available for this paper at this time.
In this paper, we present a general-purpose solution to cartoon image synthesis with unpaired training data. In contrast to previous works learning pre-defined cartoon styles for specified usage scenarios (portrait or scene), we aim to train a common cartoon translator which can not only simultaneously render exaggerated anime faces and realistic cartoon scenes, but also provide flexible user controls for desired cartoon styles. It is challenging due to the complexity of the task and the absence of paired data. The core idea of the proposed method is to introduce gated cycle mapping, that utilizes a novel gated mapping unit to produce the category-specific style code and embeds this code into cycle networks to control the translation process. For the concept of category, we classify images into different categories (e.g., 4 types: photo/cartoon portrait/scene) and learn finer-grained category translations rather than overall mappings between two domains (e.g., photo and cartoon). Furthermore, the proposed method can be easily extended to cartoon video generation with an auxiliary dataset and a new adaptive style loss. Experimental results demonstrate the superiority of the proposed method over the state of the art and validate its effectiveness in the brand-new task of general cartoon image synthesis.
Building similarity graph...
Analyzing shared references across papers
Loading...
Yifang Men
King University
Yuan Yao
Nanjing University
Miaomiao Cui
Huazhong Agricultural University
Peking University
Alibaba Group (Cayman Islands)
Building similarity graph...
Analyzing shared references across papers
Loading...
Men et al. (Wed,) studied this question.
synapsesocial.com/papers/6a1d6c6a7f448865515e57e5 — DOI: https://doi.org/10.1109/cvpr52688.2022.00349
Synapse has enriched one closely related paper. Consider it for comparative context: