What question did this study set out to answer?

The research aims to enhance contextual sentiment recognition by addressing multimodal heterogeneity and knowledge limitations.

May 16, 2026Open Access

Research on contextual sentiment recognition based on neural encoding and decoding and knowledge guidance

Key Points

The research aims to enhance contextual sentiment recognition by addressing multimodal heterogeneity and knowledge limitations.
Proposed a dual-branch neural encoding–decoding architecture with separate branches for multimodal feature processing.
Incorporated explicit and implicit knowledge from language models for improved context understanding.
Utilized dynamic context windows to adapt to emotional changes during recognition.
Achieved accuracies of 82.1% on IEMOCAP, 78.3% on MELD, and 76.2% on DailyDialog datasets.
Demonstrated high inference speed of 950 samples/sec with the lightweight model.
Exhibited strong generalization across different datasets, enhancing practical utility.

Abstract

Abstract Contextual sentiment recognition is critical for applications such as intelligent customer service and mental health monitoring. However, existing models struggle with multimodal heterogeneity, knowledge scarcity, and inadequate capture of dynamic emotional transitions. To address these challenges, we propose a dual-branch neural encoding–decoding architecture integrated with dynamic knowledge guidance. The model processes multimodal features (text, speech, video) and contextual dependencies through separate branches, incorporating both explicit knowledge (personality traits, domain rules) and implicit knowledge distilled from large language models. A dynamic context window adapts based on emotional shifts to enhance real-time perception. Experiments on IEMOCAP, MELD, and DailyDialog datasets demonstrate that our full model achieves accuracies of 82.1%, 78.3%, and 76.2%, respectively, surpassing state-of-the-art benchmarks including fine-tuned GPT-4. The lightweight version (18.2 M parameters) maintains high inference speed (950 samples/sec) while reducing deployment costs. Furthermore, the model exhibits strong cross-dataset generalization and practical utility. This work provides an efficient framework that effectively addresses core challenges in contextual sentiment recognition, balancing performance with practicality for real-world deployment.

Bookmark

View Full Paper

Bookmark

View Full Paper

Research on contextual sentiment recognition based on neural encoding and decoding and knowledge guidance

Key Points

Abstract

Cite This Study