Modeling Past and Future for Neural Machine Translation

Zaixiang Zheng; Hao Zhou; Shujian Huang; Lili Mou; Xinyu Dai; Jiajun Chen; Zhaopeng Tu

Vol. 6 (2018)

TACL approved

Modeling Past and Future for Neural Machine Translation

Published 2018-03-11

Zaixiang Zheng
Hao Zhou
Shujian Huang
Lili Mou
Xinyu Dai
Jiajun Chen
Zhaopeng Tu

Zaixiang Zheng
Nanjing University

Hao Zhou
Toutiao AI Lab

Shujian Huang
Nanjing University

Lili Mou
University of Waterloo

Xinyu Dai
Nanjing University

Jiajun Chen
Nanjing University

Zhaopeng Tu
Tencent AI Lab

Abstract

Existing neural machine translation systems do not explicitly model what has been translated and what has not during the decoding phase. To address this problem, we propose a novel mechanism that separates the source information into two parts: translated PAST contents and untranslated FUTURE contents, which are modeled by two additional recurrent layers. The PAST and FUTURE contents are fed to both the attention model and the decoder states, which provides Neural Machine Translation (NMT) systems with the knowledge of translated and untranslated contents. Experimental results show that the proposed approach significantly improves the performance in Chinese-English, German-English, and English-German translation tasks. Specifically, the proposed model outperforms the conventional coverage model in terms of both the translation quality and the alignment error rate.

Article at MIT Press PDF (presented at ACL 2018)