TY - JOUR
T1 - Explaining any black box model using real data
AU - Björklund, Anton
AU - Henelius, Andreas
AU - Oikarinen, Emilia
AU - Kallonen, Kimmo
AU - Puolamäki, Kai
PY - 2023/8/8
Y1 - 2023/8/8
N2 - In recent years the use of complex machine learning has increased drastically. These complex black box models trade interpretability for accuracy. The lack of interpretability is troubling for, e.g., socially sensitive, safety-critical, or knowledge extraction applications. In this paper, we propose a new explanation method, SLISE, for interpreting predictions from black box models. SLISE can be used with any black box model (model-agnostic), does not require any modifications to the black box model (post-hoc), and explains individual predictions (local). We evaluate our method using real-world datasets and compare it against other model-agnostic, local explanation methods. Our approach solves shortcomings in other related explanation methods by only using existing data instead of sampling new, artificial data. The method also generates more generalizable explanations and is usable without modification across various data domains.
AB - In recent years the use of complex machine learning has increased drastically. These complex black box models trade interpretability for accuracy. The lack of interpretability is troubling for, e.g., socially sensitive, safety-critical, or knowledge extraction applications. In this paper, we propose a new explanation method, SLISE, for interpreting predictions from black box models. SLISE can be used with any black box model (model-agnostic), does not require any modifications to the black box model (post-hoc), and explains individual predictions (local). We evaluate our method using real-world datasets and compare it against other model-agnostic, local explanation methods. Our approach solves shortcomings in other related explanation methods by only using existing data instead of sampling new, artificial data. The method also generates more generalizable explanations and is usable without modification across various data domains.
KW - 113 Computer and information sciences
KW - HCI (human-computer interaction)
KW - XAI (explainable artificial intelligence)
KW - Explanations
KW - Interpretability
KW - Interpretable machine learning
KW - Local explanation
KW - Model-agnostic explanation
U2 - 10.3389/fcomp.2023.1143904
DO - 10.3389/fcomp.2023.1143904
M3 - Article
SN - 2624-9898
VL - 5
JO - Frontiers in computer science
JF - Frontiers in computer science
M1 - 1143904
ER -