Transparency Helps Reveal When Language Models Learn Meaning

Zhaofeng Wu; William Merrill; Hao Peng; Iz Beltagy; Noah Smith

Vol. 11 (2023)

TACL approved

Transparency Helps Reveal When Language Models Learn Meaning

Published 2023-07-24

Zhaofeng Wu
William Merrill
Hao Peng
Iz Beltagy
Noah Smith

Zhaofeng Wu
Massachusetts Institute of Technology

William Merrill
New York University

Hao Peng
Allen Institute for Artificial Intelligence

Iz Beltagy
Allen Institute for Artificial Intelligence

Noah Smith
University of Washington

Abstract

Many current NLP systems are built from language models trained to optimize unsupervised objectives on large amounts of raw text. Under what conditions might such a procedure acquire meaning? Our systematic experiments with synthetic data reveal that, with languages where all expressions have context-independent denotations (i.e., languages with strong transparency), both autoregressive and masked language models successfully learn to emulate semantic relations between expressions.

However, when denotations are changed to be context-dependent with the language otherwise unmodified, this ability degrades. Turning to natural language, our experiments with a specific phenomenon -- referential opacity -- add to the growing body of evidence that current language models do not represent natural language semantics well. We show this failure relates to the context-dependent nature of natural language form-meaning mappings.

Article at MIT Press Presented at ACL 2023