What question did this study set out to answer?

The research aims to develop a parameter-efficient framework for enhancing code representations, addressing limitations in current methods.

January 23, 2026Open Access

Enhancing Parameter-Efficient Code Representations with Retrieval and Structural Priors

Key Points

The research aims to develop a parameter-efficient framework for enhancing code representations, addressing limitations in current methods.
Introduced a structure–semantic dual-channel retrieval mechanism for external code knowledge
Developed a graph relative bias module to improve attention on structural relationships
Implemented a span-discriminative contrastive objective for clearer span-level representations
Conducted experiments across three benchmarks involving six programming languages
Achieved 22.1% improvement in Exact Match for code generation tasks
Obtained 4.4% increase in BLEU scores for code refinement
Outperformed state-of-the-art parameter-efficient baselines while using only about 5% of trainable parameters

Abstract

High-quality code representations are fundamental to code intelligence. Achieving such representations with parameter-efficient fine-tuning (PEFT) remains a key challenge. While code pre-trained models (CodePTMs) offer a robust foundation for general-purpose embeddings, current PEFT approaches face two main obstacles when adapting them: (i) they fail to adequately capture the deep structural characteristics of programs, and (ii) they are limited by the model’s finite internal parameters, restricting their ability to overcome inherent knowledge bottlenecks. To address these challenges, we introduce a parameter-efficient code representation learning framework that combines retrieval augmentation with structure-aware priors. Our framework features three complementary, lightweight modules: first, a structure–semantic dual-channel retrieval mechanism that infuses high-quality external code knowledge as non-parametric memory to alleviate the knowledge bottleneck; second, a graph relative bias module that strengthens the attention mechanism’s capacity to model structural relationships within programs; and third, a span-discriminative contrastive objective that sharpens the distinctiveness and boundary clarity of span-level representations. Extensive experiments on three benchmarks spanning six programming languages show that our method consistently outperforms state-of-the-art parameter-efficient baselines. Notably, on structure-sensitive tasks using the PLBART backbone, RS-Rep surpasses full fine-tuning, delivering a 22.1% improvement in Exact Match for code generation and a 4.4% increase in BLEU scores for code refinement, all while utilizing only about 5% of the trainable parameters.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Zheng et al. (Wed,) studied this question.

synapsesocial.com/papers/69730ed4c8125b09b0d1ea70 https://doi.org/https://doi.org/10.3390/app16021106

Bookmark

View Full Paper