May 7, 2024Open Access

Fine-tuning Large Language Models: A Brief Introduction

Key Points

Key points are not available for this paper at this time.

Abstract

Supervised Fine-tuning, Reinforcement Learning from Human Feedback and the latest SteerLM Author · Xuzeng He ( ORCID: 0009–0005–7317–7426) Introduction Large Language Models (LLMs), usually trained with extensive text data, can demonstrate remarkable capabilities in handling various tasks with state-of-the-art performance. However, people nowadays typically want something more personalised instead of a general solution.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Xuzeng He (Tue,) studied this question.

www.synapsesocial.com/papers/68e6b3b1b6db6435876352d5 — DOI: https://doi.org/10.59350/1aezq-kk827

Fine-tuning Large Language Models: A Brief Introduction

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion