The growing demand for automation in legal contract management exposes a persistent limitation of current language models: insufficient adaptation to the semantic, structural, and regulatory constraints of legal language. While large language models perform well on general NLP tasks, their direct application to legal document classification, clause extraction, and contract summarization often yields unstable, legally unreliable outputs. This work presents a structured methodological pipeline for evaluating and adapting language models for legal contract automation, combining domain-specific fine-tuning of open-source models with a controlled comparative assessment against large general-purpose LLMs used exclusively in inference mode. The methodology integrates legal corpus curation, clause-level annotation, and efficient adaptation techniques, and is evaluated across three core tasks: contract document classification, normative clause extraction, and regulatory summarization. The evaluation protocol is explicitly designed to disentangle the effects of supervision from deployment constraints arising in regulated legal settings. Experimental results show consistent and statistically significant performance gains for legally adapted models over general-purpose baselines, achieving Macro-F1 of 0.921 in classification, span-level F1 of 0.903 in clause extraction, and ROUGE-L of 0.886 in summarization ( p 0.01). Robustness analysis and cross-validation confirm stability across heterogeneous private-sector contract types. The findings should be interpreted under the evaluated comparison regime and highlight that, in legally constrained multi-stage workflows, task-aligned supervision provides measurable structural benefits that are not reducible to model scale alone when general-purpose LLMs are restricted to inference-only deployment.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jaime Govea
Universidad de las Américas
Iván Ortiz-Garcés
Universidad de las Américas
Pablo Palacios
Diego Portales University
SHILAP Revista de lepidopterología
Frontiers in Artificial Intelligence
Diego Portales University
Pontificia Universidad Católica del Ecuador
Universidad de las Américas
Building similarity graph...
Analyzing shared references across papers
Loading...
Govea et al. (Thu,) studied this question.
synapsesocial.com/papers/69ca1210883daed6ee094ca2 — DOI: https://doi.org/10.3389/frai.2026.1782405