Machine-annotated Rationales: Faithfully Explaining Text Classification

Publication date

2021

Authors

Herrewijnen, ElizeORCID 0000-0002-2729-6599ISNI 0000000523876731
Nguyen, DongISNI 0000000419527451
Mense, Jelte P.ISNI 0000000472320951
Bex, FlorisORCID 0000-0002-5699-9656ISNI 0000000118066508

Editors

Advisors

Supervisors

DOI

Document Type

Contribution to conference

License

Abstract

We propose an approach to faithfully explaining text classification models, using a specifically designed neural network to find explanations in the form of machine-annotated rationales during the prediction process. This results in faithful explanations that are similar to human-annotated rationales, while not requiring human explanation examples during training. The quality of found explanations is measured on faithfulness, quantitative similarity to human explanations, and through a user evaluation.

Keywords

Citation

Herrewijnen, E, Nguyen, D, Mense, J & Bex, F 2021, 'Machine-annotated Rationales: Faithfully Explaining Text Classification', Paper presented at 35th AAAI Conference on Artificial Intelligence, 8/02/21 - 9/02/21., conference