What does this research mean for the field?

High-quality, domain-specific large language models for legal assistance can be efficiently constructed using limited computational resources via QLoRA and 4-bit quantization. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to enhance efficiency in legal case review by introducing a lightweight large language model.

June 3, 2026Open Access

ILL: A Lightweight Large Language Model for Legal and Courtroom Assistance

Key Points

This research aims to enhance efficiency in legal case review by introducing a lightweight large language model.
Trained the ILL model using QLoRA with 4-bit quantization on an RTX 4060 GPU.
Evaluated model performance using BertScore F1, perplexity, and MMLU task metrics.
Conducted experiments with small-scale datasets to assess model effectiveness.
ILL achieved a BertScore F1 of 0.8037, indicating strong performance.
The model demonstrated a perplexity of 1.89, showcasing its accuracy in legal tasks.
Fine-tuning preserved TruthfulQA performance and showed competitive results on MMLU tasks.

Abstract

Manual case review in legal and courtroom workflows faces efficiency bottlenecks. While LLMs offer potential for vertical domains, they often struggle with domain-specific accuracy and hallucinations. This paper introduces ILL, a lightweight model for legal and courtroom assistance trained via QLoRA. By employing 4-bit quantization on an RTX 4060 GPU, ILL achieves precise knowledge transfer with low computational costs. The model attained a BertScore F1 of 0.8037, and a perplexity of 1.89, while largely preserving TruthfulQA performance after fine-tuning and demonstrating competitive results on MMLU tasks. Experiments demonstrate that this method performs excellently on small-scale datasets and shows approximate convergence at scales of about 1200 sentences. This work validates the feasibility and efficiency of constructing high-quality vertical auxiliary models using limited computational resources.

ILL: A Lightweight Large Language Model for Legal and Courtroom Assistance

Key Points

Abstract

Cite This Study