In recent years, cyberspace governance has become a critical component of national security strategies worldwide. Although social network platforms provide users with convenient channels for expression and information acquisition, unregulated, harmful euphemisms have become increasingly prevalent. These euphemisms disrupt the order of the digital space and trigger secondary harms such as cyberbullying and regional discrimination. Currently, researches on Chinese harmful euphemism detection face three key challenges: the lack of large-scale annotated datasets, the cognitive reasoning deficit in lightweight models, and the latency constraints of Large Language Models (LLMs), which collectively constrain detection performance and real-world generalization. To address these issues, this study first collected a large corpus from social networking platforms and constructed a fine-grained annotated harmful euphemism dataset. Then, a representation learning framework was designed by integrating deep prompt-based chain-of-thought reasoning with multi-head contrastive learning. This framework introduces external knowledge from LLMs to enhance the diversity and precision of semantic representations. Finally, a multi-dimensional semantic perception fusion framework was proposed. It incorporates multiple semantic perception channels and a cross-channel dynamic fusion mechanism, enabling the model to better capture implicit semantics and integrate external contextual knowledge. Experimental results show that our approach significantly outperforms state-of-the-art lightweight models. While large-scale LLMs exhibit superior zero-shot transferability in cross-domain tasks, our proposed model maintains highly competitive performance with substantially lower inference latency and computational overhead. This research provides a novel methodological and technical foundation for detecting harmful euphemisms in social networks.
Xie et al. (Sun,) studied this question.