Learning Structured Text Representations

Yang Liu; Mirella Lapata

Vol. 6 (2018)

TACL approved

Learning Structured Text Representations

Published 2018-01-31

Yang Liu
Mirella Lapata

Yang Liu
University of Edinburgh

Mirella Lapata
University of Edinburgh

Abstract

In this paper, we focus on learning structure-aware document representations from data without recourse to a discourse parser or additional annotations. Drawing inspiration from recent efforts to empower neural networks with a structural bias, we propose a model that can encode a document while automatically inducing rich structural dependencies. Specifically, we embed a differentiable non-projective parsing algorithm into a neural model and use attention mechanisms to incorporate the structural biases. Experimental evaluation across different tasks and datasets shows that the proposed model achieves state-of-the-art results on document modeling tasks while inducing intermediate structures which are both interpretable and meaningful.

Article at MIT Press PDF (presented at NAACL 2018)