A Transparent Pipeline for Identifying Sexism in Social Media: Combining Explainability with Model Prediction
Publication date
2024-10
Editors
Advisors
Supervisors
Document Type
Article
Metadata
Show full item recordCollections
License
cc_by_nc_nd
Abstract
Featured Application: We show illustrative examples of sexist language to describe the taxonomy and explainability analysis. In this study, we present a new approach that combines multiple Bidirectional Encoder Representations from Transformers (BERT) architectures with a Convolutional Neural Network (CNN) framework designed for sexism detection in text at a granular level. Our method relies on the analysis and identification of the most important terms contributing to sexist content using Shapley Additive Explanations (SHAP) values. This approach involves defining a range of Sexism Scores based on both model predictions and explainability, moving beyond binary classification to provide a deeper understanding of the sexism-detection process. Additionally, it enables us to identify specific parts of a sentence and their respective contributions to this range, which can be valuable for decision makers and future research. In conclusion, this study introduces an innovative method for enhancing the clarity of large language models (LLMs), which is particularly relevant in sensitive domains such as sexism detection. The incorporation of explainability into the model represents a significant advancement in this field. The objective of our study is to bridge the gap between advanced technology and human comprehension by providing a framework for creating AI models that are both efficient and transparent. This approach could serve as a pipeline for future studies to incorporate explainability into language models.
Keywords
ensemble model, explainable AI (XAI), large language models (LLMs), natural language processing (NLP), sexism detection, Shapley values, General Materials Science, Instrumentation, General Engineering, Process Chemistry and Technology, Computer Science Applications, Fluid Flow and Transfer Processes, SDG 5 - Gender Equality
Citation
Mohammadi, H, Giachanou, A & Bagheri, A 2024, 'A Transparent Pipeline for Identifying Sexism in Social Media : Combining Explainability with Model Prediction', Applied Sciences (Switzerland), vol. 14, no. 19, 8620. https://doi.org/10.3390/app14198620