Skip to main navigation menu Skip to main content Skip to site footer

Introduction to Mathematical Language Processing: Informal Proofs, Word Problems, and Supporting Tasks

Abstract

Automating discovery in mathematics and science will require sophisticated methods of information extraction and abstract reasoning, including models that can convincingly process relationships between mathematical elements and natural language, to produce problem solutions of real-world value. We analyze mathematical language processing methods across five strategic sub-areas (identifier-definition extraction, formula retrieval, natural language premise selection, math word problem solving, and informal theorem proving) in recent years, highlighting prevailing methodologies, existing limitations, overarching trends, and promising avenues for future research.

Article at MIT Press Presented at EMNLP 2023