March 19, 2024Open Access

PanDa: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation

Key Points

Key points are not available for this paper at this time.

Abstract

Prompt Transfer (PoT) is a recently-proposed approach to improve prompt-tuning, by initializing the target prompt with the existing prompt trained on similar source tasks. However, such a vanilla PoT approach usually achieves sub-optimal performance, as (i) the PoT is sensitive to the similarity of source-target pair and (ii) directly fine-tuning the prompt initialized with source prompt on target task might lead to forgetting of the useful general knowledge learned from source task. To tackle these issues, we propose a new metric to accurately predict the prompt transferability (regarding (i)), and a novel PoT approach (namely PanDa ) that leverages the knowledge distillation technique to alleviate the knowledge forgetting effectively (regarding (ii)). Extensive and systematic experiments on 189 combinations of 21 source and 9 target datasets across 5 scales of PLMs demonstrate that: 1) our proposed metric works well to predict the prompt transferability ; 2) our PanDa consistently outperforms the vanilla PoT approach by 2.3% average score (up to 24.1%) among all tasks and model sizes ; 3) with our PanDa approach, prompt-tuning can achieve competitive and even better performance than model-tuning in various PLM scales scenarios .

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper

Cite This Study

Zhong et al. (Tue,) studied this question.

synapsesocial.com/papers/68e734edb6db6435876ae466 https://doi.org/https://doi.org/10.1109/tkde.2024.3376453

Also Consider

Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context:

AI에게 질문

Bookmark

View Full Paper