The core of font generation technology lies in preserving character structure and accurately transferring stylistic features. While extensive research has been conducted on Chinese character generation, studies on Yi script font generation remain in their infancy. This paper, therefore, propose a framework that employs a multi-scale conditional diffusion model to generate Yi script fonts. A Multi-scale Content Perception (MCP) module is designed. This module employs a constructed channel-space dual-domain multi-scale attention coordination mechanism to hierarchically capture multi-scale spatial and content information of characters within images. To achieve cross-modal feature fusion for Yi characters, an Efficient Style Insertion (ESI) module was developed. This module employs a cross-attention mechanism integrating multi-head mechanisms and efficient channel attention to enable cross-modal interaction of font style features. It further introduces block-wise attention to overcome computational redundancy issues inherent in traditional cross-attention. Finally, comparative experiments across Yi font datasets—including the public standardized Yi font file (PSFF-Yi) dataset, Self-Built Handwritten Yi (SBHW-Yi) dataset, and Ancient Manuscript Handwritten Yi (AMHW-Yi) dataset—demonstrate the feasibility of this approach.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zedong Li
Mengdi Li
CunRui Wang
International Journal of Pattern Recognition and Artificial Intelligence
Building similarity graph...
Analyzing shared references across papers
Loading...
Li et al. (Wed,) studied this question.
www.synapsesocial.com/papers/698435f0f1d9ada3c1fb5621 — DOI: https://doi.org/10.1142/s0218001426590123