Problems in Current Text Simplification Research: New Data Can Help
Published
2015-05-24
Wei Xu
,
Chris Callison-Burch
,
Courtney Napoles
Wei Xu
University of Pennsylvania
Chris Callison-Burch
University of Pennsylvania
Courtney Napoles
John Hopkins University
Abstract
Simple Wikipedia has dominated simplification research in the past 5 years. In this opinion paper, we argue that focusing on Wikipedia limits simplification research. We back up our arguments with corpus analysis and by highlighting statements that other researchers have made in the simplification literature. We introduce a new simplification dataset that is a significant improvement over Simple Wikipedia, and present a novel quantitative-comparative approach to study the quality of simplification data resources.
PDF (presented at EMNLP 2015)