January 1, 2017Open Access

MEANT 2.0: Accurate semantic MT evaluation for any output language

Key Points

Key points are not available for this paper at this time.

Abstract

We describe a new version of MEANT, which participated in the metrics task of the Second Conference on Machine Translation (WMT 2017). MEANT 2.0 uses idfweighted distributional ngram accuracy to determine the phrasal similarity of semantic role fillers and yields better correlations with human judgments of translation quality than earlier versions. The improved phrasal similarity enables a subversion of MEANT to accurately evaluate translation adequacy for any output language, even languages without an automatic semantic parser. Our results show that MEANT, which is a non-ensemble and untrained metric, consistently performs as well as the top participants in previous yearsincluding ensemble and trained onesacross different output languages. We also present the timing statistics for MEANT for better estimation of the evaluation cost. MEANT 2.0 is open source and publicly available. 1

Bookmark

View Full Paper

Bookmark

View Full Paper

MEANT 2.0: Accurate semantic MT evaluation for any output language

Key Points

Abstract

Cite This Study