The impact of curation errors in the PDBBind Database on machine learning predictions of protein-protein binding affinity
Database (Oxford) 2025-11-25
Summary:
The PDBBind database has been widely utilized for the computational prediction of protein-protein binding affinities. While the accuracy of the PDBBind-curated equilibrium dissociation constants (KD) has been reported for the protein-ligand subset of the PDBBind database, the curation accuracy has not been reported for the protein-protein subset. Here, we present a detailed manual analysis for the subset of PDBBind records with PubMed Central Open Access primary publications and find that ~19%...