Explanation-Based Human Debugging of NLP Models: A Survey”
Published
2021-12-30
Piyawat Lertvittayakumjorn
,
Francesca Toni
Piyawat Lertvittayakumjorn
Imperial College London
Francesca Toni
Imperial College London
Abstract
Debugging a machine learning model is hard since the bug usually involves the training data and the learning process. This becomes even harder for an opaque deep learning model if we have no clue about how the model actually works. In this survey, we review papers that exploit explanations to enable humans to give feedback and debug NLP models. We call this problem explanation-based human debugging (EBHD). In particular, we categorize and discuss existing work along three dimensions of EBHD (the bug context, the workflow, and the experimental setting), compile findings on how EBHD components affect the feedback providers, and highlight open problems that could be future research directions.
Article at MIT Press
Presented at NAACL 2022