Oracle Bone Inscriptions (OBI), the earliest systematic writing in China, are crucial for understanding early Chinese civilization. However, many inscriptions remain undeciphered due to limited data, complex glyphs, and weak semantic consistency. Although Generative Adversarial Networks (GAN) and Diffusion Models have introduced new possibilities to the field, most existing methods primarily focus on visual features and lack semantic integration. To address this challenge, we propose DCSD-OBI, a Dual-Conditional Stable Diffusion model that integrates OBI images and modern Chinese text during reverse diffusion to jointly learn structural and semantic features, thereby improving character generation accuracy. To reduce training cost while preserving generative quality, we adopt Low-Rank Adaptation (LoRA) to fine-tune only the U-Net's cross-attention modules. Furthermore, we replace CLIP with Chinese-CLIP, a version tailored for the Chinese language, to improve cross-modal semantic consistency. Experiments results show DCSD-OBI improves decipherment accuracy by 11%, highlighting its effectiveness and potential for advancing OBI research.
Ran et al. (Fri,) studied this question.