Key points are not available for this paper at this time.
Following the recent adoption by the machine translation community of automatic evaluation using the BLEU/NIST scoring process, we conduct an in-depth study of a similar idea for evaluating summaries. The results show that automatic evaluation using unigram co-occurrences between summary pairs correlates surprising well with human evaluations, based on various statistical metrics; while direct application of the BLEU evaluation procedure does not always give good results.
Building similarity graph...
Analyzing shared references across papers
Loading...
Chin-Yew Lin
Eduard Hovy
University of Southern California
Marina Del Rey Hospital
Building similarity graph...
Analyzing shared references across papers
Loading...
Lin et al. (Wed,) studied this question.
www.synapsesocial.com/papers/6a088929ad370a6b44de29b2 — DOI: https://doi.org/10.3115/1073445.1073465
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: