Generation of Molecular Counterfactuals for Explainable Machine Learning Based on Core-Substituent Recombination
The use of black box machine learning models whose decisions cannot be understood limits the acceptance of predictions in interdisciplinary research and camouflages artificial learning characteristics leading to predictions for other than anticipated reasons. Consequently, there is increasing interest in explainable artificial intelligence to rationalize predictions and uncover potential pitfalls. Among others, relevant approaches include feature attribution methods to identify molecular structures determining predictions and counterfactuals (CFs) or contrastive explanations. CFs are defined as variants of test instances with minimal modifications leading to opposing predictions. In medicinal chemistry, CFs have thus far only been little investigated although they are particularly intuitive from a chemical perspective. We introduce a new methodology for the systematic generation of CFs that is centered on well-defined structural analogues of test compounds. The approach is transparent, computationally straightforward, and shown to provide a wealth of CFs for test sets. The method is made freely available.
- Veröffentlicht in:
ChemMedChem - Typ:
Article - Autoren:
Lamens, Alec; Bajorath, Jürgen - Jahr:
2024
Informationen zur Zitierung
Lamens, Alec; Bajorath, Jürgen: Generation of Molecular Counterfactuals for Explainable Machine Learning Based on Core-Substituent Recombination, ChemMedChem, 2024, 19, February, https://chemistry-europe.onlinelibrary.wiley.com/doi/full/10.1002/cmdc.202300586, Lamens.Bajorath.2024a,
@Article{Lamens.Bajorath.2024a,
author={Lamens, Alec; Bajorath, Jürgen},
title={Generation of Molecular Counterfactuals for Explainable Machine Learning Based on Core-Substituent Recombination},
journal={ChemMedChem},
volume={19},
month={February},
url={https://chemistry-europe.onlinelibrary.wiley.com/doi/full/10.1002/cmdc.202300586},
year={2024},
abstract={The use of black box machine learning models whose decisions cannot be understood limits the acceptance of predictions in interdisciplinary research and camouflages artificial learning characteristics leading to predictions for other than anticipated reasons. Consequently, there is increasing interest in explainable artificial intelligence to rationalize predictions and uncover potential...}}