April 23, 2024Open Access

Does Instruction Tuning Make LLMs More Consistent?

Key Points

Key points are not available for this paper at this time.

Abstract

The purpose of instruction tuning is enabling zero-shot performance, but instruction tuning has also been shown to improve chain-of-thought reasoning and value alignment (Si et al. , 2023). Here we consider the impact on consistency, i. e. , the sensitivity of language models to small perturbations in the input. We compare 10 instruction-tuned LLaMA models to the original LLaMA-7b model and show that almost across-the-board they become more consistent, both in terms of their representations and their predictions in zero-shot and downstream tasks. We explain these improvements through mechanistic analyses of factual recall.

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper

Cite This Study

Fierro et al. (Tue,) studied this question.

synapsesocial.com/papers/68e6e09eb6db64358765c52b https://doi.org/https://doi.org/10.48550/arxiv.2404.15206

AI에게 질문

Bookmark

View Full Paper