ARCTIC-3D: Automatic Retrieval and ClusTering of Interfaces in Complexes from 3D structural information

Publication date

2023-07-11

Authors

Giulini, MarcoISNI 000000052348341X
Vargas Honorato, Rodrigo VISNI 0000000492959592
Rivera, Jesus L.
Bonvin, Alexandre M J JORCID 0000-0001-7369-1322ISNI 0000000396501354

Editors

Advisors

Supervisors

Document Type

/dk/atira/pure/researchoutput/researchoutputtypes/workingpaper/preprint
Open Access logo

License

cc_by_nc_nd

Abstract

The formation of a stable complex between proteins lies at the core of a wide variety of biological processes and has been the focus of countless experiments. The huge amount of information contained in the protein structural interactome in the Protein Data Bank can now be used to characterise and classify the existing biological interfaces. We here introduce ARCTIC-3D, a fast and user-friendly data mining and clustering software to retrieve data and rationalise the interface information associated with the protein input data. We demonstrate its use by various examples ranging from showing the increased interaction complexity of eukaryotic proteins, 20% of which on average have more than 3 different interfaces compared to only 10% for prokaryotes, to associating different functions to different interfaces. In the context of modelling biomolecular assemblies, we introduce the concept of “recognition entropy”, related to the number of possible interfaces of the components of a protein-protein complex, which we demonstrate to correlate with the modelling difficulty. The identified interface clusters can also be used to generate various combinations of interface-specific restraints for integrative modelling. The ARCTIC-3D software is freely available at https://github.com/haddocking/arctic3d and can be accessed as a web-service at https://wenmr.science.uu.nl/arctic-3d

Keywords

Citation

Giulini, M, Honorato, R V, Rivera, J L & Bonvin, A M J J 2023 'ARCTIC-3D : Automatic Retrieval and ClusTering of Interfaces in Complexes from 3D structural information' bioRxiv. https://doi.org/10.1101/2023.07.10.548477