open access publication

Preprint, 2024

One Hundred Years of Influenza A Evolution

bioRxiv, Page 2024.02.27.582392, 10.1101/2024.02.27.582392

Contributors

Nielsen, Bjarke Frost 0000-0002-3868-9187 (Corresponding author) [1] [2] [3] Berrig, Christian 0000-0001-9427-355X [2] Grenfell, Bryan Thomas 0000-0003-3227-5909 [3] Andreasen, Viggo 0000-0001-8152-3898 [2]

Affiliations

  1. [1] University of Copenhagen
  2. [NORA names: KU University of Copenhagen; University; Denmark; Europe, EU; Nordic; OECD];
  3. [2] Roskilde University
  4. [NORA names: RUC Roskilde University; University; Denmark; Europe, EU; Nordic; OECD];
  5. [3] Princeton University
  6. [NORA names: United States; America, North; OECD]

Abstract

Leveraging the simplicity of raw nucleotide distances, we provide an intuitive window into the evolution of the human influenza A ‘nonstructural’ (NS) gene. In an analysis suggested by the eminent Danish biologist Freddy B. Christiansen, we illustrate the existence of a continuous genetic “backbone” of influenza A NS genes, steadily increasing in distance to the 1918 root over more than a century. Interestingly, the 2009 influenza pandemic represents a clear departure from this enduring genetic backbone. Utilizing nucleotide distance maps and phylogenetic analyses, we illustrate remaining uncertainties regarding the origin of the 2009 pandemic, highlighting the complexity of influenza evolution. The NS gene is interesting precisely because it experiences neutral genetic drift over long periods of time time, while sudden deviations from this drift pattern can indicate changes in other genes via the hitchhiking effect. Our approach employs two measures based on genotypic distance — the rooted temporal Hamming map and the unrooted temporal Hamming distribution — to analyze the evolutionary dynamics of the NS gene. The rooted Hamming map elucidates distances between a reference sequence and all other sequences over time. In contrast, the unrooted temporal Hamming distribution captures the distribution of genotypic distances between simultaneously circulating viruses, thereby revealing patterns of sequence diversity and epi-evolutionary dynamics. Our study aims to supplement traditional tree-based phylogenetic inference with these direct temporal distance-based measures, offering transparent insights into the evolution of the influenza NS gene.

Keywords

Hamming, NS gene, analysis, backbone, changes, complex, deviation, distance, distance map, distance-based measures, distribution, diversity, drift, drift patterns, dynamics, effect, evolution, evolutionary dynamics, genes, genetic backbone, genetic drift, genotypic distance, hitchhiking, hitchhiking effect, inference, influenza, influenza evolution, influenza pandemic, insights, long periods, maps, measurements, neutral genetic drift, nonstructure, nucleotide, nucleotide distance, origin, pandemic, patterns, patterns of sequence diversity, phylogenetic analysis, phylogenetic inference, reference, reference sequence, root, sequence, sequence diversity, simultaneously, study, sudden deviation, time, time time, transparent insight, uncertainty, virus, window, years

Funders

  • Danish National Research Foundation
  • United States Department of Veterans Affairs
  • Carlsberg Foundation

Data Provider: Digital Science