open access publication

Preprint, 2024

Insights into gene conversion and crossing-over processes from long-read sequencing of human, chimpanzee and gorilla testes and sperm

bioRxiv, Page 2024.07.05.601967, 10.1101/2024.07.05.601967

Contributors

Porsborg, Peter Soerud (Corresponding author) [1] Charmouh, Anders Poulsen [1] Singh, Vinod Kumar [1] [2] Winge, Sofia Boeg 0000-0003-1666-1228 [3] [4] Hvilsom, Christina 0000-0001-7870-6888 [5] Pelizzola, Marta 0000-0001-6909-2335 [1] Laurentino, Sandra S 0000-0002-5213-2756 [6] Neuhaus, Nina 0000-0003-0181-6194 [6] Hobolth, Asger 0000-0003-4056-1286 [1] Bataillon, Thomas Martin Jean 0000-0002-4730-2538 [1] Almstrup, Kristian 0000-0002-1832-0307 [3] [4] [7] Besenbacher, Søren 0000-0003-1455-1738 [1] [2] Schierup, Mikkel Heide 0000-0002-5028-1790 (Corresponding author) [1]

Affiliations

  1. [1] Aarhus University
  2. [NORA names: AU Aarhus University; University; Denmark; Europe, EU; Nordic; OECD];
  3. [2] Aarhus University Hospital
  4. [NORA names: Central Denmark Region; Hospital; Denmark; Europe, EU; Nordic; OECD];
  5. [3] International Center for Research and Research Training in Endocrine Disruption of Male Reproduction and Child Health, Copenhagen, 2100, Denmark
  6. [NORA names: Denmark; Europe, EU; Nordic; OECD];
  7. [4] Rigshospitalet
  8. [NORA names: Capital Region of Denmark; Hospital; Denmark; Europe, EU; Nordic; OECD];
  9. [5] Copenhagen Zoo
  10. [NORA names: Miscellaneous; Denmark; Europe, EU; Nordic; OECD];

Abstract

Abstract Homologous recombination rearranges genetic information during meiosis to generate new combinations of variants. Recombination also causes new mutations, affects the GC content of the genome and reduces selective interference. Here, we use HiFi long-read sequencing to directly detect crossover and gene conversion events from switches between the two haplotypes along single HiFi-reads from testis tissue of humans, chimpanzees and gorillas as well as human sperm samples. Furthermore, based on DNA methylation calls, we classify the cellular origin of reads to either somatic or germline cells in the testis tissue. We identify 1692 crossovers and 1032 gene conversions in nine samples and investigate their chromosomal distribution. Crossovers are more telomeric and correlate better with recombination maps than gene conversions. We show a strong concordance between a human double-strand break map and the human samples, but not for the other species, supporting different PRDM9-programmed double-strand break loci. We estimate the average gene conversion tract lengths to be similar and very short in all three species (means 40-100 bp, fitted well by a geometric distribution) and that 95-98% of non-crossover events do not involve tracts intersecting with polymorphism and are therefore not detectable. Finally, we detect a GC bias in the gene conversion of both single and multiple SNVs and show that the GC-biased gene conversion affects SNVs flanking crossover events. This implies that gene conversion events associated with crossover events are much longer (estimated above 500 bp) than those associated with non-crossover events. Highly accurate long-read sequencing combined with the classification of reads to specific cell types provides a new, powerful way to make individual, detailed maps of gene conversion and crossovers for any species.

Keywords

Abstract Homologous recombination, DNA, GC bias, GC content, GC-biased gene conversion, HIFI, HiFi long-read sequencing, SNVs, accurate long-read sequencing, average gene conversion tract length, bias, calls, cell types, cells, cellular origin, chimpanzees, chromosomal distribution, classification, combination, combination of variants, concordance, content, conversion, conversion events, conversion tract length, crossing-over processes, crossover, crossover events, detect crossovers, distribution, events, gene conversion, gene conversion events, gene conversion tract lengths, genes, genome, germline, germline cells, gorillas, haplotypes, homologous recombination, human samples, human sperm samples, humans, interference, length, loci, long-read sequencing, maps, meiosis, methylation calls, mutations, non-crossover events, origin, polymorphism, process, recombination, recombination maps, reducing selective interference, samples, selective interference, sequence, species, sperm, sperm samples, switching, testis, testis tissue, tissue, tissues of humans, tract, tract length, type, variants

Funders

  • European Cooperation in Science and Technology
  • Novo Nordisk Foundation

Data Provider: Digital Science