4.5 Article

A New Database (GCD) on Genome Composition for Eukaryote and Prokaryote Genome Sequences and Their Initial Analyses

Journal

GENOME BIOLOGY AND EVOLUTION
Volume 4, Issue 4, Pages 501-512

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/gbe/evs026

Keywords

GCD; oligonucleotide frequency; alignment-free sequence comparison

Funding

  1. Ministry of Education, Culture, Sports, Science and Technology, Japan
  2. Human Genome Network

Ask authors/readers for more resources

Eukaryote genomes contain many noncoding regions, and they are quite complex. To understand these complexities, we constructed a database, Genome Composition Database, for the whole genome composition statistics for 101 eukaryote genome data, as well as more than 1,000 prokaryote genomes. Frequencies of all possible one to ten oligonucleotides were counted for each genome, and these observed values were compared with expected values computed under observed oligonucleotide frequencies of length 1-4. Deviations from expected values were much larger for eukaryotes than prokaryotes, except for fungal genomes. Mammalian genomes showed the largest deviation among animals. The results of comparison are available online at http://esper.lab.nig.ac.jp/genome-composition-database/.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available