Pushing the Limits of Translation Quality Estimation

André F.T. Martins; Marcin Junczys-Dowmunt; Fabio N. Kepler; Ramón Astudillo; Chris Hokamp; Roman Grundkiewicz

Vol. 5 (2017)

TACL approved

Pushing the Limits of Translation Quality Estimation

Published 2017-07-11

André F.T. Martins
Marcin Junczys-Dowmunt
Fabio N. Kepler
Ramón Astudillo
Chris Hokamp
Roman Grundkiewicz

André F.T. Martins
Unbabel Priberam Labs Instituto de Telecomunicacoes

Marcin Junczys-Dowmunt
Adam Mickiewicz University

Fabio N. Kepler
Unbabel

Ramón Astudillo
Unbabel

Chris Hokamp
Dublin City University

Roman Grundkiewicz
Adam Mickiewicz University

Abstract

Translation quality estimation is a task of growing importance in NLP, due to its potential to reduce post-editing human effort in disruptive ways. However, this potential is currently limited by the relatively low accuracy of existing systems. In this paper, we achieve remarkable improvements by exploiting synergies between the related tasks of word-level quality estimation and automatic post-editing. First, we stack a new, carefully engineered, neural model into a rich feature-based word-level quality estimation system. Then, we use the output of an automatic post-editing system as an extra feature, obtaining striking results on WMT16: a word-level F 1 MULT score of 57.47% (an absolute gain of +7.95% over the current state of the art), and a Pearson correlation score of 65.56% for sentence-level HTER prediction (an absolute gain of +13.36%).

PDF (presented at ACL 2017)