Machine-annotated Rationales: Faithfully Explaining Text Classification
Publication date
2021
Editors
Advisors
Supervisors
DOI
Document Type
Contribution to conference
Metadata
Show full item recordCollections
License
Abstract
We propose an approach to faithfully explaining text classification models, using a specifically designed neural network to find explanations in the form of machine-annotated rationales during the prediction process. This results in faithful explanations that are similar to human-annotated rationales, while not requiring human explanation examples during training. The quality of found explanations is measured on faithfulness, quantitative similarity to human explanations, and through a user evaluation.
Keywords
Citation
Herrewijnen, E, Nguyen, D, Mense, J & Bex, F 2021, 'Machine-annotated Rationales: Faithfully Explaining Text Classification', Paper presented at 35th AAAI Conference on Artificial Intelligence, 8/02/21 - 9/02/21., conference