4.7 Article

Draft genome of the lined seahorse, Hippocampus erectus

Journal

GIGASCIENCE
Volume 6, Issue 6, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/gigascience/gix030

Keywords

genome; assembly; annotation; Hippocampus erectus

Funding

  1. Youth Foundation of National High Technology Research and Development Program [2015AA020909]
  2. Outstanding Youth Foundation in Guangdong Province [S2013050014802]
  3. Special Fund for Agro-scientific Research in the Public Interest [201403008]
  4. National Natural Science Foundation of China [41576145]
  5. China National Natural Science Foundation [31370047]
  6. Shenzhen Special Program for Future Industrial Development [JSGG20141020113728803]
  7. Special Project on the Integration of Industry, Education and Research of Guangdong Province [2013B090800017]
  8. Shenzhen Science and Technology Program [SGLH20131010105856414, GJHZ20160229173052805]
  9. Shenzhen Dapeng Special Program for Industrial Development [KY20160307]

Ask authors/readers for more resources

Background: The lined seahorse, Hippocampus erectus, is an Atlantic species and mainly inhabits shallow sea beds or coral reefs. It has become very popular in China for its wide use in traditional Chinese medicine. In order to improve the aquaculture yield of this valuable fish species, we are trying to develop genomic resources for assistant selection in genetic breeding. Here, we provide whole genome sequencing, assembly, and gene annotation of the lined seahorse, which can enrich genome resource and further application for its molecular breeding. Findings: A total of 174.6 Gb (Gigabase) raw DNA sequences were generated by the Illumina Hiseq2500 platform. The final assembly of the lined seahorse genome is around 458 Mb, representing 94% of the estimated genome size (489 Mb by k-mer analysis). The contig N50 and scaffold N50 reached 14.57 kb and 1.97 Mb, respectively. Quality of the assembled genome was assessed by BUSCO with prediction of 85% of the known vertebrate genes and evaluated using the de novo assembled RNA-seq transcripts to prove a high mapping ratio (more than 99% transcripts could be mapped to the assembly). Using homology-based, de novo and transcriptome-based prediction methods, we predicted 20 788 protein-coding genes in the generated assembly, which is less than our previously reported gene number (23 458) of the tiger tail seahorse (H. comes). Conclusion: We report a draft genome of the lined seahorse. These generated genomic data are going to enrich genome resource of this economically important fish, and also provide insights into the genetic mechanisms of its iconic morphology and male pregnancy behavior.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available