What question did this study set out to answer?

The aim is to improve the recognition of infrequently seen legal elements in judgments despite imbalances in label frequency.

April 18, 2026Open Access

Adaptive Label Reweighting via Boundary-Aware Meta Learning for Long-Tail Legal Element Recognition

Key Points

The aim is to improve the recognition of infrequently seen legal elements in judgments despite imbalances in label frequency.
Developed a boundary-aware meta-learning framework named BML-Trans.
Implemented a label-wise meta-weighting mechanism using bilevel hypergradient descent.
Focused on high-uncertainty sentences for effective training signal.
Utilized a Multi-Scale Adapter to enhance warm-up probability estimates.
Achieved an average Avg-F1 of 82.5% on the CAIL2019 dataset.
Outperformed the strongest baseline by 1.2 percentage points overall.
Improved tail-label Macro-F1 by up to 5.7 percentage points.
Demonstrated only a 14% increase in training costs.

Abstract

Legal element recognition, which identifies discrete factual elements in Chinese court judgments to support judicial analysis and case retrieval, faces a severe long-tail challenge: head-to-tail label-frequency ratios exceed 100:1, and over 60% of sentences carry no label, starving rare elements of training signal. Static reweighting methods assign fixed weights prior to training and cannot respond to the model’s evolving confidence; sample-level meta-learning couples all co-occurring label gradients to a single scalar, preventing independent tail-label amplification. We propose BML-Trans, a boundary-aware meta-learning framework that addresses both limitations. A label-wise meta-weighting mechanism maintains per-label gradient weights updated via bilevel hypergradient descent, decoupling tail-label amplification from co-occurring head labels. A boundary-aware meta-set concentrates calibration signal on high-uncertainty, tail-triggering sentences rather than on easy negatives, and a lightweight Multi-Scale Adapter sharpens the warm-up probability estimates on which boundary selection depends. Concretely, BML-Trans achieves an average Avg-F1 of 82.5% on CAIL2019 across the labor, divorce, and loan domains, outperforming the strongest baseline by 1.2 percentage points overall and by up to 5.7 percentage points on tail-label Macro-F1, at only 14% additional training cost. Ablation confirms a cascade dependency among the three components, establishing that the gains are structural rather than incidental to threshold selection or initialization.

Read Full Paperexternally

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper