March 3, 2026Open Access

Enhancing Legal Text Entailment: Evaluating Model Architectures, Training Approaches, and Interpretability

Key Points

Domain-specific pretraining improves performance in legal text classification using smaller models.
Larger language models like GPT-4o outperform smaller models with prompt engineering when fine-tuned with LoRA.
Model architectures were compared, focusing on their ability to generate interpretable decisions in legal contexts.
The integration of adaptation and explainability in one system may enhance trustworthiness and understanding.

Abstract

This study investigates the effectiveness of various model architectures and training strategies for legal text classification, with a focus on entailment classification for case decisions from the Federal Court of Canada. We compare the performance of RoBERTa models with and without domain-specific further pretraining, to larger language models such as Llama 2, Llama 3, and GPT-4o adapted to the task by using prompt engineering and LORA fine-tuning. Additionally, we investigate different methods that can be used to explain the decisions of the models and evaluate their adequacy, understandability, trustworthiness, and sufficiency. Our findings suggest that for legal entailment classification, domain-specific pretraining can improve performance for smaller models, while larger language models show promise in outperforming prompt engineering for classification when fine-tuned with LoRA, as well as in generating more interpretable explanations. To the best of our knowledge, this is the first study in the context of Canadian legal AI to explore the effects of further pretraining on small and large language models, and the integration of language model adaptation and explainability into one system for legal text entailment classification.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Michel Custeau

Diana Inkpen

Actions

Institutions

University of Ottawa

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Enhancing Legal Text Entailment: Evaluating Model Architectures, Training Approaches, and Interpretability

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study