January 1, 2019Open Access

Putting Evaluation in Context: Contextual Embeddings Improve Machine Translation Evaluation

NMNitika MathurGoogle (United States)TBTimothy BaldwinMohamed bin Zayed University of Artificial Intelligence TCTrevor CohnGoogle (United States)

Key Points

Key points are not available for this paper at this time.

Abstract

Accurate, automatic evaluation of machine translation is critical for system tuning, and evaluating progress in the field. We proposed a simple unsupervised metric, and additional supervised metrics which rely on contextual word embeddings to encode the translation and reference sentences. We find that these models rival or surpass all existing metrics in the WMT 2017 sentence-level and systemlevel tracks, and our trained model has a substantially higher correlation with human judgements than all existing metrics on the WMT 2017 to-English sentence level dataset.

Demander à l'IA

Bookmark

View Full Paper