What question did this study set out to answer?

The research aims to analyze the impact of evaluation metrics on text summarization performance for Indian legal documents.

March 27, 2026Open Access

Comparative Analysis of Text Summarization Models for Indian Legal Documents

Key Points

The research aims to analyze the impact of evaluation metrics on text summarization performance for Indian legal documents.
Conducted a comparative evaluation of extractive and abstractive summarization models.
Used standard evaluation metrics like ROUGE, BLEU, and BERTScore.
Maintained uniform experimental settings for consistency.
Different evaluation metrics highlighted various aspects of summary quality.
Conclusions about model effectiveness varied significantly with the chosen metric.
Demonstrated the need for careful interpretation of metrics in legal text summarization.

Abstract

Automatic text summarization is frequently evaluated using standard automatic metrics such as ROUGE, BLEU, and BERTScore. These metrics are widely adopted due to their ease of computation and reproducibility. However, their interpretation becomes challenging in specialized domains such as legal text, where document length, formal language, and information density differ significantly from general-purpose datasets. This paper examines how commonly used evaluation metrics influencethe interpretation of summarization performance for Indian legal documents. Using results obtained from a comparativeevaluation of extractive and abstractive summarization models under uniform experimental settings, we analyze how differentmetrics emphasize different aspects of summary quality. The study highlights that conclusions regarding model effectivenessmay vary depending on the chosen evaluation metric, underscoring the importance of careful metric interpretation in legal textsummarization research.

Read Full Paperexternally

KI fragen

Bookmark

View Full Paper