4.7 Article Data Paper

Genome assembly and transcriptome resource for river buffalo, Bubalus bubalis (2n=50)

期刊

GIGASCIENCE
卷 6, 期 10, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/gigascience/gix088

关键词

Water buffalo; genome assembly; transcriptome; annotation

资金

  1. Italian Buffalo Genome Consortium (Parco Tecnologico Padano)
  2. Italian Buffalo Genome Consortium (Universita degli Studi del Molise)
  3. Italian Buffalo Genome Consortium (CNR, Istituto di Biologia e Biotecnologia Agraria)
  4. Italian Buffalo Genome Consortium (CNR, Istituto Per Il Sistema Produzione Animale In Ambiente Mediterraneo, CISIA-VARIGEAV)
  5. Italian Buffalo Genome Consortium (Consorzio per la Sperimentazione, Divulgazione e Applicazione di Biotecniche Innovative)
  6. Italian Buffalo Genome Consortium (CRA Centro di Ricerca per la Produzione delle Carni ed il Miglioramento Genetico)
  7. Italian Buffalo Genome Consortium (Istituto Zooprofilattico Sperimentale del Mezzogiorno)
  8. Italian Buffalo Genome Consortium (Universita della Tuscia)
  9. Italian Buffalo Genome Consortium (Universita Cattolica del Sacro Cuore)
  10. Italian Buffalo Genome Consortium (Universita degli Studi di Napoli Federico II)
  11. Italian Buffalo Genome Consortium (Universita degli Studi di Sassari)
  12. Italian Buffalo Genome Consortium (Universita degli Studi di Udine)
  13. USDA-ARS [5438-31000-073-00D]
  14. GenHome from the Italian Ministry of Education
  15. NIH, National Library of Medicine

向作者/读者索取更多资源

Water buffalo is a globally important species for agriculture and local economies. A de novo assembled, well-annotated reference sequence for the water buffalo is an important prerequisite for studying the biology of this species, and is necessary to manage genetic diversity and to use modern breeding and genomic selection techniques. However, no such genome assembly has been previously reported. There are 2 species of domestic water buffalo, the river (2n = 50) and the swamp (2n = 48) buffalo. Here we describe a draft quality reference sequence for the river buffalo created from Illumina GA and Roche 454 short read sequences using the MaSuRCA assembler. The assembled sequence is 2.83 Gb, consisting of 366 983 scaffolds with a scaffold N50 of 1.41 Mb and contig N50 of 21 398 bp. Annotation of the genome was supported by transcriptome data from 30 tissues and identified 21 711 predicted protein coding genes. Searches for complete mammalian BUSCO gene groups found 98.6% of curated single copy orthologs present among predicted genes, which suggests a high level of completeness of the genome. The annotated sequence is available from NCBI at accession GCA(=)000471725.1.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据