What does this research mean for the field?

Sentence-level BLEU correlates poorly with human judgment due to its geometric mean calculation of n-gram precisions, necessitating smoothing techniques to improve evaluation accuracy. Novelty: ClaimNovelty.INCREMENTAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

Pulse nav.journalClub Débats actifs Tendances Explorer Chercheurs

Join discussions, follow papers, and never miss your next session.

Download on theApp Store

Accueil Explorer nav.journalClub Tendances

synapse

⌘+K

Politique de confidentialité

Pulse nav.journalClub Débats actifs Tendances Explorer Chercheurs

Join discussions, follow papers, and never miss your next session.

Download on theApp Store

Accueil Explorer nav.journalClub Tendances

synapse

⌘+K

Politique de confidentialité

A Systematic Comparison of Smoothing Techniques for Sentence-Level BLEU | Synapse

January 1, 2014Open Access

A Systematic Comparison of Smoothing Techniques for Sentence-Level BLEU

Key Points

Key points are not available for this paper at this time.

Abstract

BLEU is the de facto standard machine translation (MT) evaluation metric. However, because BLEU computes a geometric mean of n-gram precisions, it often correlates poorly with human judgment on the sentence-level.

Demander à l'IA

Bookmark

View Full Paper

Demander à l'IA

Bookmark

View Full Paper

Cite This Study

Chen et al. (Wed,) studied this question.

synapsesocial.com/papers/6a08d6213589fa5d64d5fdae https://doi.org/https://doi.org/10.3115/v1/w14-3346

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

1Unpacking and Transforming Feature Functions: New Ways to Smooth Phrase Tables2011 · 23 citations
2BLEU2001 · 21,551 citations
3A simple and effective hierarchical phrase reordering model2008 · 297 citations
4Results of the WMT13 Metrics Shared Task2013 · 57 citations
5Batch Tuning Strategies for Statistical Machine Translation2012 · 338 citations

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

1Unpacking and Transforming Feature Functions: New Ways to Smooth Phrase Tables2011 · 23 citations
2BLEU2001 · 21,551 citations
3A simple and effective hierarchical phrase reordering model2008 · 297 citations
4Results of the WMT13 Metrics Shared Task2013 · 57 citations
5Batch Tuning Strategies for Statistical Machine Translation2012 · 338 citations