Comparative evaluation of methods for the prediction of protein–ligand binding sites

Abstract The accurate identification of protein–ligand binding sites is of critical importance in understanding and modulating protein function. Accordingly, ligand binding site prediction has remained a research focus for over three decades with over 50 methods developed and a change of paradigm fr...

Full description

Saved in:

Bibliographic Details
Main Authors:	Javier S. Utgés, Geoffrey J. Barton
Format:	Article
Language:	English
Published:	BMC 2024-11-01
Series:	Journal of Cheminformatics
Subjects:	Ligand binding site prediction Binding pocket Benchmark Reference dataset Machine learning Drug discovery
Online Access:	https://doi.org/10.1186/s13321-024-00923-z
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846164937793077248
author	Javier S. Utgés Geoffrey J. Barton
author_facet	Javier S. Utgés Geoffrey J. Barton
author_sort	Javier S. Utgés
collection	DOAJ
description	Abstract The accurate identification of protein–ligand binding sites is of critical importance in understanding and modulating protein function. Accordingly, ligand binding site prediction has remained a research focus for over three decades with over 50 methods developed and a change of paradigm from geometry-based to machine learning. In this work, we collate 13 ligand binding site predictors, spanning 30 years, focusing on the latest machine learning-based methods such as VN-EGNN, IF-SitePred, GrASP, PUResNet, and DeepPocket and compare them to the established P2Rank, PRANK and fpocket and earlier methods like PocketFinder, Ligsite and Surfnet. We benchmark the methods against the human subset of our new curated reference dataset, LIGYSIS. LIGYSIS is a comprehensive protein–ligand complex dataset comprising 30,000 proteins with bound ligands which aggregates biologically relevant unique protein–ligand interfaces across biological units of multiple structures from the same protein. LIGYSIS is an improvement for testing methods over earlier datasets like sc-PDB, PDBbind, binding MOAD, COACH420 and HOLO4K which either include 1:1 protein–ligand complexes or consider asymmetric units. Re-scoring of fpocket predictions by PRANK and DeepPocket display the highest recall (60%) whilst IF-SitePred presents the lowest recall (39%). We demonstrate the detrimental effect that redundant prediction of binding sites has on performance as well as the beneficial impact of stronger pocket scoring schemes, with improvements up to 14% in recall (IF-SitePred) and 30% in precision (Surfnet). Finally, we propose top-N+2 recall as the universal benchmark metric for ligand binding site prediction and urge authors to share not only the source code of their methods, but also of their benchmark. Scientific contributions This study conducts the largest benchmark of ligand binding site prediction methods to date, comparing 13 original methods and 15 variants using 10 informative metrics. The LIGYSIS dataset is introduced, which aggregates biologically relevant protein–ligand interfaces across multiple structures of the same protein. The study highlights the detrimental effect of redundant binding site prediction and demonstrates significant improvement in recall and precision through stronger scoring schemes. Finally, top-N+2 recall is proposed as a universal benchmark metric for ligand binding site prediction, with a recommendation for open-source sharing of both methods and benchmarks.
format	Article
id	doaj-art-e9a09ea13a8f4837bbfb816b6e811d8e
institution	Kabale University
issn	1758-2946
language	English
publishDate	2024-11-01
publisher	BMC
record_format	Article
series	Journal of Cheminformatics
spelling	doaj-art-e9a09ea13a8f4837bbfb816b6e811d8e2024-11-17T12:45:47ZengBMCJournal of Cheminformatics1758-29462024-11-0116113510.1186/s13321-024-00923-zComparative evaluation of methods for the prediction of protein–ligand binding sitesJavier S. Utgés0Geoffrey J. Barton1Division of Computational Biology, School of Life Sciences, University of DundeeDivision of Computational Biology, School of Life Sciences, University of DundeeAbstract The accurate identification of protein–ligand binding sites is of critical importance in understanding and modulating protein function. Accordingly, ligand binding site prediction has remained a research focus for over three decades with over 50 methods developed and a change of paradigm from geometry-based to machine learning. In this work, we collate 13 ligand binding site predictors, spanning 30 years, focusing on the latest machine learning-based methods such as VN-EGNN, IF-SitePred, GrASP, PUResNet, and DeepPocket and compare them to the established P2Rank, PRANK and fpocket and earlier methods like PocketFinder, Ligsite and Surfnet. We benchmark the methods against the human subset of our new curated reference dataset, LIGYSIS. LIGYSIS is a comprehensive protein–ligand complex dataset comprising 30,000 proteins with bound ligands which aggregates biologically relevant unique protein–ligand interfaces across biological units of multiple structures from the same protein. LIGYSIS is an improvement for testing methods over earlier datasets like sc-PDB, PDBbind, binding MOAD, COACH420 and HOLO4K which either include 1:1 protein–ligand complexes or consider asymmetric units. Re-scoring of fpocket predictions by PRANK and DeepPocket display the highest recall (60%) whilst IF-SitePred presents the lowest recall (39%). We demonstrate the detrimental effect that redundant prediction of binding sites has on performance as well as the beneficial impact of stronger pocket scoring schemes, with improvements up to 14% in recall (IF-SitePred) and 30% in precision (Surfnet). Finally, we propose top-N+2 recall as the universal benchmark metric for ligand binding site prediction and urge authors to share not only the source code of their methods, but also of their benchmark. Scientific contributions This study conducts the largest benchmark of ligand binding site prediction methods to date, comparing 13 original methods and 15 variants using 10 informative metrics. The LIGYSIS dataset is introduced, which aggregates biologically relevant protein–ligand interfaces across multiple structures of the same protein. The study highlights the detrimental effect of redundant binding site prediction and demonstrates significant improvement in recall and precision through stronger scoring schemes. Finally, top-N+2 recall is proposed as a universal benchmark metric for ligand binding site prediction, with a recommendation for open-source sharing of both methods and benchmarks.https://doi.org/10.1186/s13321-024-00923-zLigand binding site predictionBinding pocketBenchmarkReference datasetMachine learningDrug discovery
spellingShingle	Javier S. Utgés Geoffrey J. Barton Comparative evaluation of methods for the prediction of protein–ligand binding sites Journal of Cheminformatics Ligand binding site prediction Binding pocket Benchmark Reference dataset Machine learning Drug discovery
title	Comparative evaluation of methods for the prediction of protein–ligand binding sites
title_full	Comparative evaluation of methods for the prediction of protein–ligand binding sites
title_fullStr	Comparative evaluation of methods for the prediction of protein–ligand binding sites
title_full_unstemmed	Comparative evaluation of methods for the prediction of protein–ligand binding sites
title_short	Comparative evaluation of methods for the prediction of protein–ligand binding sites
title_sort	comparative evaluation of methods for the prediction of protein ligand binding sites
topic	Ligand binding site prediction Binding pocket Benchmark Reference dataset Machine learning Drug discovery
url	https://doi.org/10.1186/s13321-024-00923-z
work_keys_str_mv	AT javiersutges comparativeevaluationofmethodsforthepredictionofproteinligandbindingsites AT geoffreyjbarton comparativeevaluationofmethodsforthepredictionofproteinligandbindingsites

Comparative evaluation of methods for the prediction of protein–ligand binding sites

Similar Items