open access publication

Preprint, 2024

The power and limits of predicting exon-exon interactions using protein 3D structures

bioRxiv, Page 2024.03.01.582917, 10.1101/2024.03.01.582917

Contributors

Liebold, Jeanine 0009-0002-0797-1806 [1] Del Moral-Morales, Aylin 0000-0003-3985-628X [1] [2] Manalastas-Cantos, Karen G 0000-0002-5483-9616 [1] [3] Tsoy, Olga V 0000-0002-7592-2080 [1] Kurtz, Stefan 0000-0001-5783-0054 [1] Baumbach, Jan 0000-0002-0282-0462 (Corresponding author) [1] [4] Newaz, Khalique 0000-0002-1192-8360 (Corresponding author) [1]

Affiliations

  1. [1] Universität Hamburg
  2. [NORA names: Germany; Europe, EU; OECD];
  3. [2] Universidad Autónoma Metropolitana
  4. [NORA names: Mexico; America, Central; OECD];
  5. [3] Centre for Structural Systems Biology
  6. [NORA names: Germany; Europe, EU; OECD];
  7. [4] University of Southern Denmark
  8. [NORA names: SDU University of Southern Denmark; University; Denmark; Europe, EU; Nordic; OECD]

Abstract

Abstract Alternative splicing (AS) effects on cellular functions can be captured by studying changes in the underlying protein-protein interactions (PPIs). Because AS results in the gain or loss of exons, existing methods for predicting AS-related PPI changes utilize known PPI interfacing exon-exon interactions (EEIs), which only cover ∼5% of known human PPIs. Hence, there is a need to extend the existing limited EEI knowledge to advance the functional understanding of AS. In this study, we explore whether existing computational PPI interface prediction (PPIIP) methods, originally designed to predict residue-residue interactions (RRIs), can be used to predict EEIs. We evaluate three recent state-of-the-art PPIIP methods for the RRI- as well as EEI-prediction tasks using known protein complex structures, covering ∼230,000 RRIs and ∼27,000 EEIs. Our results provide the first evidence that existing PPIIP methods can be extended for the EEI prediction task, showing F-score, precision, and recall performances of up to ∼38%, ∼63%, and ∼28%, respectively, with a false discovery rate of less than 5%. Our study provides insights into the power and limits of existing PPIIP methods to predict EEIs, thus guiding future developments of computational methods for the EEI prediction task. We provide streamlined computational pipelines integrating each of the three considered PPIIP methods for the EEI prediction task to be utilized by the scientific community.

Keywords

AS results, Abstract Alternative splicing, F-score, PPI changes, RRI, alternative splicing, cellular functions, changes, community, complex structure, computational methods, computational pipeline, development, development of computational methods, discovery rate, effect, evidence, exon, false discovery rate, function, functional understanding, gain, human protein-protein interaction, interaction, interface prediction, knowledge, limitations, loss, loss of exons, method, performance, pipeline, power, precision, prediction, prediction task, protein, protein 3D structures, protein complex structures, protein-protein interactions, rate, recall performance, residue-residue interactions, results, scientific community, splicing, state-of-the-art, structure, study, task, understanding

Funders

  • Federal Ministry of Education and Research
  • German Academic Exchange Service
  • Consejo Nacional de Humanidades, Ciencias y Tecnologías

Data Provider: Digital Science