Re-interpreting Rules Interpretability

Trustworthy machine learning requires a high level of interpretability of machine learning models, yet many models are inherently black-boxes. Training interpretable models instead—or using them to mimic the black-box model—seems like a viable solution. In practice, however, these interpretable models are still unintelligible due to their size and complexity. In this paper, we present an approach to explain the logic of large interpretable models that can be represented as sets of logical rules by a simple, and thus intelligible, descriptive model. The coarseness of this descriptive model and its fidelity to the original model can be controlled, so that a user can understand the original model in varying levels of depth. We showcase and discuss this approach on three real-world problems from healthcare, material science, and finance.

Published in:
International Journal of Data Science and Analytics
Type:
Article
Authors:
Adilova, Linara; Kamp, Michael; Andrienko, Gennady; Andrienko, Natalia
Year:
2023
Source:
https://link.springer.com/article/10.1007/s41060-023-00398-5

Citation information

Adilova, Linara; Kamp, Michael; Andrienko, Gennady; Andrienko, Natalia: Re-interpreting Rules Interpretability, International Journal of Data Science and Analytics, 2023, https://link.springer.com/article/10.1007/s41060-023-00398-5, Adilova.etal.2023a,

Open BibTeX citation

Re-interpreting Rules Interpretability

Citation information

Associated Lamarr Researchers

Linara Adilova

Prof. Dr. Gennady Andrienko

Prof. Dr. Natalia Andrienko