4.7 Article Data Paper

Pseudomolecule-level assembly of the Chinese oil tree yellowhorn (Xanthoceras sorbifolium) genome

Journal

GIGASCIENCE
Volume 8, Issue 6, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/gigascience/giz070

Keywords

Xanthoceras sorbifolium; yellowhorn; PacBio sequencing; genome assembly; Hi-C; genome annotation; conserved chromosome

Funding

  1. Central Public-Interest Scientific Institution Basal Research Fund [CAFYBB2019QD001]
  2. National 12th Five-Year Plan for Science & Technology Support of China [2015BAD07B0106]
  3. National Natural Science Foundation of China [31800571, 31870594, 31760213]
  4. National Key Research and Development Plan of China [2016YFC050080506]
  5. Key Research Development Program of Liaoning Province [2017204001]

Ask authors/readers for more resources

Background: Yellowhorn (Xanthoceras sorbifolium) is a species of the Sapindaceae family native to China and is an oil tree that can withstand cold and drought conditions. A pseudomolecule-level genome assembly for this species will not only contribute to understanding the evolution of its genes and chromosomes but also bring yellowhorn breeding into the genomic era. Findings: Here, we generated 15 pseudomolecules of yellowhorn chromosomes, on which 97.04% of scaffolds were anchored, using the combined Illumina HiSeq, Pacific Biosciences Sequel, and Hi-C technologies. The length of the final yellowhorn genome assembly was 504.2 Mb with a contig N50 size of 1.04 Mb and a scaffold N50 size of 32.17 Mb. Genome annotation revealed that 68.67% of the yellowhorn genome was composed of repetitive elements. Gene modelling predicted 24,672 protein-coding genes. By comparing orthologous genes, the divergence time of yellowhorn and its close sister species longan (Dimocarpus longan) was estimated at similar to 33.07 million years ago. Gene cluster and chromosome synteny analysis demonstrated that the yellowhorn genome shared a conserved genome structure with its ancestor in some chromosomes. Conclusions: This genome assembly represents a high-quality reference genome for yellowhorn. Integrated genome annotations provide a valuable dataset for genetic and molecular research in this species. We did not detect whole-genome duplication in the genome. The yellowhorn genome carries syntenic blocks from ancient chromosomes. These data sources will enable this genome to serve as an initial platform for breeding better yellowhorn cultivars.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available