Explanation-Based Human Debugging of NLP Models: A Survey”

Piyawat Lertvittayakumjorn; Francesca Toni

Vol. 9 (2021)

TACL approved

Explanation-Based Human Debugging of NLP Models: A Survey”

Published 2021-12-30

Piyawat Lertvittayakumjorn
Francesca Toni

Piyawat Lertvittayakumjorn
Imperial College London

Francesca Toni
Imperial College London

Abstract

Debugging a machine learning model is hard since the bug usually involves the training data and the learning process. This becomes even harder for an opaque deep learning model if we have no clue about how the model actually works. In this survey, we review papers that exploit explanations to enable humans to give feedback and debug NLP models. We call this problem explanation-based human debugging (EBHD). In particular, we categorize and discuss existing work along three dimensions of EBHD (the bug context, the workflow, and the experimental setting), compile findings on how EBHD components affect the feedback providers, and highlight open problems that could be future research directions.

Article at MIT Press Presented at NAACL 2022