open access publication

Preprint, 2024

How to deal with internal fragment ions?

bioRxiv, Page 2024.04.18.589916, 10.1101/2024.04.18.589916

Contributors

Grimaud, Arthur 0000-0002-6438-2402 (Corresponding author) [1] Babović, Maša [1] Holck, Frederik Haugaard [1] Jensen, Ole Noerregaard 0000-0003-1862-8528 [1] Schwämmle, Veit Stefan 0000-0002-9708-6722 [1]

Affiliations

  1. [1] University of Southern Denmark
  2. [NORA names: SDU University of Southern Denmark; University; Denmark; Europe, EU; Nordic; OECD]

Abstract

1 Abstract Tandem mass spectrometry of peptides and proteins generates mass spectra of their gas-phase fragmentation product ions, including N-terminal, C-terminal, and internal fragment ions. Whereas N- and C-terminal ions are routinely assigned and identified using computational methods, internal fragment ions are often difficult to annotate correctly. They become particularly relevant for long peptides and full proteoforms where the peptide backbone is more likely to be fragmented multiple times. Internal fragment ions potentially offer tremendous information regarding amino acid sequences and positions of post-translational modifications of peptides and intact proteins. However, their practical application is challenged by the vast number of theoretical internal fragments that exist for long amino acid sequences, leading to a high risk of false-positive annotations. We analyze the mass spectral contributions of internal fragment ions in spectra from middle-down and top-down experiments and introduce a novel graph-based annotation approach designed to manage the complexity of internal fragments. Our graph-based representation allows us to compare multiple candidate proteoforms in a single graph, and to assess different candidate annotations in a fragment ion spectrum. We demonstrate cases from middle-down and top-down data where internal ions enhance amino acid sequence coverage of polypeptides and proteins and accurate localization of post-translational modifications. We conclude that our graph-based method provides a general approach to process complex tandem mass spectra, enhance annotation of internal fragment ions, and improve proteoform sequencing and characterization by mass spectrometry.

Keywords

Abstract, C-terminal, C-terminal ions, N-, N-terminal, accurate localization, acid sequence, amino, amino acid sequence, amino acid sequence coverage, annotation, annotation approach, applications, approach, backbone, cases, characterization, complex, computational methods, data, experiments, fragment ion spectra, fragment ions, fragmentation product ions, fragments, graph, graph-based methods, graph-based representation, information, intact protein, internal fragment, internal fragment ions, internal ions, ion spectra, ions, localization of post-translational modifications, long amino acid sequences, long peptides, mass spectra, mass spectrometry, mass spectrometry of peptides, method, middle-down, modification, modification of peptides, multiple times, peptide, peptide backbone, polypeptide, position, post-translational modification of peptides, post-translational modifications, product ions, protein, proteoforms, representation, risk, sequence, single graph, spectra, spectrometry, tandem mass spectrometry of peptides, time, top-down data, top-down experiments

Funders

  • Novo Nordisk Foundation
  • European Commission
  • The Velux Foundations

Data Provider: Digital Science