What question did this study set out to answer?

This research aims to enhance Yi character detection in historical documents using advanced representation learning techniques.

March 28, 2026Open Access

Fine grained representation learning for low resource Yi script detection and dataset construction

Key Points

This research aims to enhance Yi character detection in historical documents using advanced representation learning techniques.
Developed a fine-grained representation learning framework (FGRL-YiNet)
Integrated dynamic convolution and adaptive multi-scale fusion modules
Created the YiPrint-694 dataset for training data in low-resource scenarios
Conducted extensive experiments on Yi benchmarks and the public MTHv2 dataset
FGRL-YiNet significantly outperformed existing models on Yi benchmarks, particularly for weak strokes
Demonstrated strong generalizability on the MTHv2 dataset
Established a benchmark for underserved scripts, contributing to digital heritage preservation

Abstract

Abstract Yi character detection in historical documents is challenged by complex morphology, dense strokes, and multi-scale layouts. To address these issues, we propose a novel fine-grained representation learning framework for Yi character detection (FGRL-YiNet) that integrates dynamic convolution and adaptive multi-scale fusion modules. This design enables the model to adaptively refine receptive fields to capture elusive stroke topology while suppressing background interference, directly addressing the fundamental limitations of static feature extraction in existing methods. Integrated with multi-scale feature fusion and a differentiable binarization head, our end-to-end system achieves robust character localization under severe degradation. Furthermore, we develop the YiPrint-694 dataset to support training in this low-resource domain. Extensive experiments show that FGRL-YiNet significantly outperforms state-of-the-art models on Yi benchmarks, particularly for weak strokes, and demonstrates strong generalizability on the public MTHv2 dataset. This work establishes a benchmark and architectural paradigm for underserved scripts, enabling practical solutions for digital heritage preservation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Haipeng Sun

Xueyan Ding

Hua Yu

Actions

Institutions

University of California, Berkeley

Minzu University of China

Dalian Minzu University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Fine grained representation learning for low resource Yi script detection and dataset construction

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study