open access publication

Article, 2021

The genome of Magnolia biondii Pamp. provides insights into the evolution of Magnoliales and biosynthesis of terpenoids

Horticulture Research, ISSN 2662-6810, 2052-7276, Volume 8, 1, Pages 1-13, 10.1038/s41438-021-00471-9

Contributors

Dong, Shan-Shan 0000-0002-2557-3361 [1] Liu, Min [2] Liu, Yang 0000-0002-5942-839X [1] [2] Chen, Fei 0000-0002-4346-6906 [3] Yang, Ting 0000-0002-2872-4954 [2] Chen, Lu [1] Zhang, Xingtan 0000-0002-5207-0882 [4] Guo, Xing [2] Fang, Dongming [2] Li, Linzhou 0000-0003-0799-0900 [2] Deng, Tian [1] Yao, Zhang-Xiu [1] Lang, Xiao-An [1] Gong, Yi-Qing [1] Wu, Ernest T Y 0000-0001-8800-8805 [5] Wang, Yaling [6] Shen, Yamei [7] Gong, Xun [8] Liu, Huan 0000-0003-3909-0931 (Corresponding author) [2] [9] Zhang, Shou-Zhou 0000-0001-9070-0593 (Corresponding author) [1]

Affiliations

  1. [1] Fairy Lake Botanical Garden
  2. [NORA names: China; Asia, East];
  3. [2] BGI Group (China)
  4. [NORA names: China; Asia, East];
  5. [3] Nanjing Forestry University
  6. [NORA names: China; Asia, East];
  7. [4] Fujian Agriculture and Forestry University
  8. [NORA names: China; Asia, East];
  9. [5] University of British Columbia
  10. [NORA names: Canada; America, North; OECD];

Abstract

Magnolia biondii Pamp. (Magnoliaceae, magnoliids) is a phylogenetically, economically, and medicinally important ornamental tree species widely grown and cultivated in the north-temperate regions of China. Determining the genome sequence of M. biondii would help resolve the phylogenetic uncertainty of magnoliids and improve the understanding of individual trait evolution within the Magnolia genus. We assembled a chromosome-level reference genome of M. biondii using ~67, ~175, and ~154 Gb of raw DNA sequences generated via Pacific Biosciences single-molecule real-time sequencing, 10X Genomics Chromium, and Hi-C scaffolding strategies, respectively. The final genome assembly was ~2.22 Gb, with a contig N50 value of 269.11 kb and a BUSCO complete gene percentage of 91.90%. Approximately 89.17% of the genome was organized into 19 chromosomes, resulting in a scaffold N50 of 92.86 Mb. The genome contained 47,547 protein-coding genes, accounting for 23.47% of the genome length, whereas 66.48% of the genome length consisted of repetitive elements. We confirmed a WGD event that occurred very close to the time of the split between the Magnoliales and Laurales. Functional enrichment of the Magnolia-specific and expanded gene families highlighted genes involved in the biosynthesis of secondary metabolites, plant-pathogen interactions, and responses to stimuli, which may improve the ecological fitness and biological adaptability of the lineage. Phylogenomic analyses revealed a sister relationship of magnoliids and Chloranthaceae, which are sister to a clade comprising monocots and eudicots. The genome sequence of M. biondii could lead to trait improvement, germplasm conservation, and evolutionary studies on the rapid radiation of early angiosperms.

Keywords

BUSCO, BUSCO complete gene percentage, China, Chloranthaceae, Laurales, Magnolia, Magnolia genus, Magnoliales, N50, N50 value, PAMPs, Pacific, Pacific Biosciences single-molecule real-time sequencing, WGD, WGD events, adaptation, analysis, angiosperms, assembly, biological adaptation, biosynthesis, biosynthesis of secondary metabolites, chromium, chromosome, chromosome-level reference genome, clade, conservation, contig N50 value, contigs, early angiosperms, ecological fitness, elements, enrichment, eudicots, events, evolution, evolutionary studies, family, fitness, functional enrichment, gene family, genes, genome, genome assembly, genome length, genome sequence, genus, germplasm, germplasm conservation, improvement, interaction, length, lineages, magnoliids, medicine, metabolites, monocots, north temperate regions, ornamental tree species, percentage, phylogenetic uncertainty, phylogenomic analysis, plant-pathogen interactions, protein-coding genes, rapid radiation, real-time sequencing, regions of China, relationship, repetitive elements, response, scaffold N50, scaffolding strategies, scaffolds, secondary metabolites, sequence, single-molecule real-time sequencing, species, stimuli, strategies, study, time, trait evolution, trait improvement, traits, tree species, uncertainty, values

Funders

  • National Natural Science Foundation of China
  • State Council of the People's Republic of China
  • Ministry of Science and Technology of the People's Republic of China

Data Provider: Digital Science