What question did this study set out to answer?

To enhance zero-shot performance of large language models through a novel fine-tuning technique targeting two specific tokens.

April 23, 2026Open Access

2T-FT: Two-Token Fine-tuning improves zero-shot performance with minimal training

Key Points

To enhance zero-shot performance of large language models through a novel fine-tuning technique targeting two specific tokens.
Proposed a two-token fine-tuning method leveraging implicitly generated Chain of Thought paths.
Evaluated performance on arithmetic tasks using specific datasets: MultiArith, GSM8K, and SVAMP.
Compared efficiency against the LoRA fine-tuning method by measuring training time and performance output.
Achieved a 22.7% improvement on MultiArith, 9.0% on GSM8K, and 2.3% on SVAMP.
Reduced processing time by over 90% compared to traditional LoRA fine-tuning methods.

Abstract

Chain of Thought (CoT) prompting has been shown to improve the performance of large language models (LLMs) in a wide range of tasks, including arithmetic, common-sense, and symbolic reasoning. However, this improvement requires the development of effective CoT prompts. On the other hand, more recent work has shown that CoT reasoning paths are often inherently present in top-k alternative decoding sequences, even in the absence of any specific prompting technique. In this study, we propose a new fine-tuning method that exploits this property by targeting only two specific tokens of these pre-existing CoT responses. We demonstrate that fine-tuning only two tokens using the model’s own implicitly generated CoT paths leads to a significant efficiency gain, reducing training time while still achieving meaningful performance improvements. When evaluated on arithmetic datasets, we achieved a 22.7% improvement on MultiArith, 9.0% on GSM8K, and 2.3% on SVAMP when validated on the Phi-2 model from a greedy decoding perspective, reducing the processing time by over 90% compared to the LoRA fine-tuning method. Code is publicly available at: https://github.com/paulosantosneto/2tft .

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Paulo S. Neto

Universidade Federal do Rio Grande

Jardel D. S. Dyonisio

Universidade Federal do Rio Grande

João F. S. S. Lemos

Universidade Federal do Rio Grande

Journals

Neural Computing and Applications

Actions

Institutions

Universidade Federal do Rio Grande

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

2T-FT: Two-Token Fine-tuning improves zero-shot performance with minimal training

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study