Generation of Molecular Counterfactuals for Explainable Machine Learning Based on Core-Substituent Recombination

The use of black box machine learning models whose decisions cannot be understood limits the acceptance of predictions in interdisciplinary research and camouflages artificial learning characteristics leading to predictions for other than anticipated reasons. Consequently, there is increasing interest in explainable artificial intelligence to rationalize predictions and uncover potential pitfalls. Among others, relevant approaches include feature attribution methods to identify molecular structures determining predictions and counterfactuals (CFs) or contrastive explanations. CFs are defined as variants of test instances with minimal modifications leading to opposing predictions. In medicinal chemistry, CFs have thus far only been little investigated although they are particularly intuitive from a chemical perspective. We introduce a new methodology for the systematic generation of CFs that is centered on well-defined structural analogues of test compounds. The approach is transparent, computationally straightforward, and shown to provide a wealth of CFs for test sets. The method is made freely available.

  • Published in:
    ChemMedChem
  • Type:
    Article
  • Authors:
    Lamens, Alec; Bajorath, Jürgen
  • Year:
    2024

Citation information

Lamens, Alec; Bajorath, Jürgen: Generation of Molecular Counterfactuals for Explainable Machine Learning Based on Core-Substituent Recombination, ChemMedChem, 2024, 19, February, https://chemistry-europe.onlinelibrary.wiley.com/doi/full/10.1002/cmdc.202300586, Lamens.Bajorath.2024a,

Associated Lamarr Researchers

lamarr institute person Bajorath Juergen - Lamarr Institute for Machine Learning (ML) and Artificial Intelligence (AI)

Prof. Dr. Jürgen Bajorath

Area Chair Life Sciences to the profile