March 3, 2026Open Access

Förstärkningsinlärning från mänsklig återkoppling för informationsutvinning av juridisk tekst

Key Points

RLHF enhances model performance on formal information extraction tasks, leading to more precise results.
Moderate inter-annotator agreement indicates the task's complexity, even among domain experts.
Evaluation utilized a dataset of 282 expert-annotated legal text segments linked to specific acts.
Experiments show that human feedback yields better coherence and accuracy compared to AI feedback.

Abstract

Extracting structured information from complex legal texts poses a significant challenge due to the ambiguity and precision required in the legal domain. While Large Language Models (LLMs) have shown promise for such tasks, the effectiveness of Reinforcement Learning from Human Feedback (RLHF) in improving their performance on formal information extraction, such as identifying ontology-based facts, acts, and preconditions, has not been thoroughly explored. In this study, we examine methods to enhance the performance of smallerscale LLMs on legal information extraction by using 1) RLHF and 2) Reinforcement Learning from AI Feedback (RLAIF). We construct a dataset of 282 expert-annotated legal text segments, each labelled with preconditions or subfacts linked to specific acts or facts based on a legal-domain ontology. For instance, if borrowing a book requires library membership, the model should identify this precondition and the corresponding section in the legal document. We collect feedback on model-generated answers from six domain experts and a GPT-4.1-based LLM-as-a-judge, using evaluation criteria consistent across both sources. The human feedback shows moderate inter-annotator agreement (Fleiss’ Kappa ≈ 0.5), indicating the inherent difficulty of the task, even among experts. Our findings demonstrate that RLHF leads to better model performance than RLAIF. Human-in-the-loop training yields more accurate and coherent extractions, while models fine-tuned on AI feedback tend to produce shorter responses that are often correct but less comprehensive. Notably, human feedback promotes gradual, structured improvements in output quality, reinforcing the value of expert evaluation for complex NLP tasks. While promising, our method requires scaling to larger datasets to validate its effectiveness fully. Nevertheless, our study provides early evidence that RLHF, particularly with expert input, is a powerful tool for aligning LLMs with high-precision information extraction goals in complex domains such as law.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Jacques Fürst (Wed,) studied this question.

synapsesocial.com/papers/69a76226c6e9836116a30491

Bookmark

View Full Paper