Exploring Practical Gaps in Using Cross Entropy to Implement Maximum Mutual Information Criterion for Rationalization

Wei Liu; Zhiying Deng; Zhongyu Niu; Jun Wang; Haozhao Wang; Ruixuan Li

Vol. 13 (2025)

TACL approved

Exploring Practical Gaps in Using Cross Entropy to Implement Maximum Mutual Information Criterion for Rationalization

Published 2025-12-25

Wei Liu
Zhiying Deng
Zhongyu Niu
Jun Wang
Haozhao Wang
Ruixuan Li

Wei Liu
School of Computer Science and Technology, Huazhong University of Science and Technology

Zhiying Deng
Central China Normal University

Zhongyu Niu
School of Computer Science and Technology, Huazhong University of Science and Technology

Jun Wang
iWudao.Tech

Haozhao Wang
School of Computer Science and Technology, Huazhong University of Science and Technology

Ruixuan Li

Abstract

Rationalization is a framework that aims to build self-explanatory NLP models by extracting a subset of human-intelligible pieces of their inputting texts. It involves a cooperative game where a selector selects the most human-intelligible parts of the input as the rationale, followed by a predictor that makes predictions based on these selected rationales. Existing literature uses the cross-entropy between the model's predictions and the ground-truth labels to measure the informativeness of the selected rationales, guiding the selector to choose better ones. In this study, we first theoretically analyze the objective of rationalization by decomposing it into two parts: the model-agnostic informativeness of the rationale candidates and the predictor's degree of fit. We then provide various empirical evidence to support that, under this framework, the selector tends to sample from a limited small region, causing the predictor to overfit these localized areas. This results in a significant mismatch between the cross-entropy objective and the informativeness of the rationale candidates, leading to suboptimal solutions. To address this issue, we propose a simple yet effective method that introduces random vicinal perturbations to the selected rationale candidates. This approach broadens the predictor's assessment to a vicinity around the selected rationale candidate. Compared to recent competitive methods, our method significantly improves rationale quality (by up to $6.6\%$) across six widely used classification datasets. Further experiments show that it can also generalize to the reading comprehension task and the fact extraction and verification task.

Presented at ACL 2025 Article at MIT Press