What type of study is this?

This is a Experimental Study study.

November 3, 2025Open Access

Data augmentation and contrastive learning based on large language models

Key Points

The proposed method enhances spoken language understanding using synthetic data generated by ChatGPT, improving feature interaction.
Cross-entropy loss is optimized alongside contrastive loss, significantly boosting intent detection accuracy and slot filling metrics.
This approach constructs positive and negative intent-slot pairs, optimizing the model's performance using a contrastive learning mechanism.
Overall improvements were validated through ablation studies, highlighting the effectiveness of mixed data augmentation in dialogue systems.

Abstract

For the problem of insufficient feature interaction between intent classification and slot filling in spoken language understanding tasks, this paper proposes a method that uses ChatGPT to generate more diverse samples, combined with a contrastive learning approach, to improve the model architecture and strengthen the interaction between intent and slot features. Specifically, prompts are designed for ChatGPT to generate diverse synthetic data with the same slots but different intents, and with the same intents but different slots. A contrastive learning module is further designed, in which positive and negative intent-slot sample pairs are constructed via the ChatGPT-based mixed data augmentation method. The feature space distribution is optimized using a weighted InfoNCE loss, enhancing the aggregation of similar features and the separation of dissimilar ones. Meanwhile, a multi-task joint training framework is employed to simultaneously optimize the cross-entropy loss for intent classification and the contrastive loss, enabling deeper semantic interaction between intents and slots, thereby improving the overall model performance. Experimental results on the ATIS and SNIPS datasets demonstrate that the proposed method significantly outperforms traditional baseline models in both intent detection accuracy and slot filling F1 score. In addition, ablation studies confirm the effectiveness of the contrastive learning and mixed data augmentation components. Overall, this work introduces a contrastive learning mechanism to effectively address the insufficient label-feature interaction in spoken language understanding tasks, offering a novel approach for optimizing multi-task dialogue systems.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Yang et al. (Fri,) studied this question.

synapsesocial.com/papers/6907f1ac0328c9fb7920b5c9 https://doi.org/https://doi.org/10.54254/2977-3903/2025.25301

Bookmark

View Full Paper