Learning characteristics of graph neural networks predicting protein–ligand affinities
In drug design, compound potency prediction is a popular machine learning application. Graph neural networks (GNNs) predict ligand affinity from graph representations of protein–ligand interactions typically extracted from X-ray structures. Despite some promising findings leading to claims that GNNs can learn details of protein–ligand interactions, such predictions are also controversially viewed. For example, evidence has been presented that GNNs might not learn protein–ligand interactions but memorize ligand and protein training data instead. We have carried out affinity predictions with six GNN architectures on community-standard datasets and rationalized the predictions using explainable artificial intelligence. The results confirm a strong influence of ligand—but not protein—memorization during GNN learning and also show that some GNN architectures increasingly prioritize interaction information for predicting high affinities. Thus, while GNNs do not comprehensively account for protein–ligand interactions and physical reality, depending on the model, they balance ligand memorization with learning of interaction patterns.
- Published in:
Nature Machine Intelligence - Type:
Article - Authors:
Mastropietro, Andrea; Pasculli, Giuseppe; Bajorath, Jürgen - Year:
2023 - Source:
https://www.nature.com/articles/s42256-023-00756-9
Citation information
Mastropietro, Andrea; Pasculli, Giuseppe; Bajorath, Jürgen: Learning characteristics of graph neural networks predicting protein–ligand affinities, Nature Machine Intelligence, 2023, 5, 1427--1436, November, https://www.nature.com/articles/s42256-023-00756-9, Mastropietro.etal.2023a,
@Article{Mastropietro.etal.2023a,
author={Mastropietro, Andrea; Pasculli, Giuseppe; Bajorath, Jürgen},
title={Learning characteristics of graph neural networks predicting protein–ligand affinities},
journal={Nature Machine Intelligence},
volume={5},
pages={1427--1436},
month={November},
url={https://www.nature.com/articles/s42256-023-00756-9},
year={2023},
abstract={In drug design, compound potency prediction is a popular machine learning application. Graph neural networks (GNNs) predict ligand affinity from graph representations of protein–ligand interactions typically extracted from X-ray structures. Despite some promising findings leading to claims that GNNs can learn details of protein–ligand interactions, such predictions are also controversially...}}