iSEE: Interface Structure, Evolution and Energy-based machine learning predictor of binding affinity changes upon mutations

Publication date

2019-02

Authors

Geng, CunliangISNI 000000050599841X
Vangone, AnnaISNI 0000000506294764
Folkers, G.E.ISNI 0000000390350786
Xue, LiISNI 0000000506297551
Bonvin, A.M.J.J.ORCID 0000-0001-7369-1322ISNI 0000000396501354

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

Abstract

Quantitative evaluation of binding affinity changes upon mutations is crucial for protein engineering and drug design. Machine learning-based methods are gaining increasing momentum in this field. Due to the limited number of experimental data, using a small number of sensitive predictive features is vital to the generalization and robustness of such machine learning methods. Here we introduce a fast and reliable predictor of binding affinity changes upon single point mutation, based on a random forest approach. Our method, iSEE, uses a limited number of interface Structure, Evolution and Energy-based features for the prediction. iSEE achieves, using only 31 features, a high prediction performance with a Pearson correlation coefficient (PCC) of 0.80 and a root mean square error of 1.41 kcal mol-1 on a diverse training dataset consisting of 1102 mutations in 57 protein-protein complexes. It competes with existing state-of-the-art methods on two blind test datasets. Predictions for a new dataset of 540 mutations in 58 protein complexes from the recently published SKEMPI 2.0 database reveals that none of the current methods perform well (PCC<0.4), although their combination does improve the predictions. Feature analysis for iSEE underlines the significance of evolutionary conservations for quantitative prediction of mutation effects. As an application example, we perform a full mutation scanning of the interface residues in the MDM2-p53 complex. This article is protected by copyright. All rights reserved.

Keywords

binding affinity, full mutation scanning, machine learning, protein–protein interactions, single point mutation, SDG 15 - Life on Land

Citation

Geng, C, Vangone, A, Folkers, G E, Xue, L C & Bonvin, A M J J 2019, 'iSEE : Interface Structure, Evolution and Energy-based machine learning predictor of binding affinity changes upon mutations', Proteins: Structure, Function and Genetics, vol. 87, no. 2, pp. 110-119. https://doi.org/10.1002/prot.25630