What question did this study set out to answer?

This work aims to develop a lightweight dialogue generation system for emotionally supportive interactions while ensuring computational efficiency and safety.

May 19, 2026Open Access

Efficient and responsible transformer based conversational agents for emotionally supportive dialogue

Key Points

This work aims to develop a lightweight dialogue generation system for emotionally supportive interactions while ensuring computational efficiency and safety.
Proposed model is based on T5-small architecture and fine-tuned on MentalChat16K corpus.
No reinforcement learning or emotion-specific training objectives were utilized.
Empirical evaluations compared the model's performance against GPT-2 baselines.
Achieved BLEU score of 32.14, ROUGE-L of 44.72, and BERTScore-F1 of 85.11.
Expert evaluations indicated high coherence and emotional appropriateness with substantial inter-rater agreement.
No factual inaccuracies or unsafe responses were identified during manual review.

Abstract

Abstract Conversational agents designed for emotionally supportive interactions face challenges in balancing affective responsiveness, computational efficiency, and safety in communication. Prior approaches frequently depend on large-scale models, handcrafted affective objectives, or reinforcement learning from human feedback, which can limit scalability and interpretability. This work presents a lightweight, domain-adapted dialogue generation system based on the T5-small architecture, fine-tuned on MentalChat16K, a curated corpus of real and synthetic emotional-support conversations. The proposed model operates without reinforcement learning or emotion-specific training objectives, yet demonstrates encouraging alignment with affective cues and fluent response generation within the evaluated dataset. Empirical evaluation shows improvements over zero-shot and fine-tuned GPT-2 baselines, achieving BLEU (32.14), ROUGE-L (44.72), and BERTScore-F1 (85.11). Expert human assessments indicated high ratings in coherence, emotional appropriateness, and contextual relevance, with substantial inter-rater agreement. Qualitative error analysis indicated generally conservative and context-aware responses within the evaluated sample. During manual review of this sample, no factual hallucinations, medical overreach, or overtly unsafe responses were observed; however, systematic safety benchmarking was beyond the scope of the present study. This study provides initial evidence that compact transformer-based models, when adapted to domain-specific corpora and evaluated under controlled conditions, can support efficient and affectively appropriate dialogue generation in emotionally supportive non-clinical settings, while requiring further safety validation before broader real-world deployment.

Bookmark

View Full Paper

Cite This Study

Saleela et al. (Sun,) studied this question.

synapsesocial.com/papers/6a0bfe2d166b51b53d3796a9 https://doi.org/https://doi.org/10.1007/s44163-026-01426-6

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Bookmark

View Full Paper