Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases
Published 2019 View Full Article
- Home
- Publications
- Publication Search
- Publication Details
Title
Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases
Authors
Keywords
-
Journal
NUCLEIC ACIDS RESEARCH
Volume 47, Issue 21, Pages 10994-11006
Publisher
Oxford University Press (OUP)
Online
2019-10-02
DOI
10.1093/nar/gkz841
References
Ask authors/readers for more resources
Related references
Note: Only part of the references are listed.- Errors in long-read assemblies can critically affect protein prediction
- (2019) Mick Watson et al. NATURE BIOTECHNOLOGY
- Genomic architecture of haddock (Melanogrammus aeglefinus) shows expansions of innate immune genes and short tandem repeats
- (2018) Ole K. Tørresen et al. BMC GENOMICS
- Comparative genome-wide characterization leading to simple sequence repeat marker development for Nicotiana
- (2018) Xuewen Wang et al. BMC GENOMICS
- Glutamine Codon Usage and polyQ Evolution in Primates Depend on the Q Stretch Length
- (2018) Pablo Mier et al. Genome Biology and Evolution
- Massive variation of short tandem repeats with functional consequences across strains ofArabidopsis thaliana
- (2018) Maximilian O. Press et al. GENOME RESEARCH
- Nuclear, chloroplast, and mitochondrial data of a US cannabis DNA database
- (2018) Rachel Houston et al. INTERNATIONAL JOURNAL OF LEGAL MEDICINE
- Population data and phylogenetic structure of Han population from Jiangsu province of China on GlobalFiler STR loci
- (2018) Atif Adnan et al. INTERNATIONAL JOURNAL OF LEGAL MEDICINE
- The repeat structure of two paralogous genes, Yersinia ruckeri invasin ( yrInv ) and a “ Y. ruckeri invasin-like molecule”, ( yrIlm ) sheds light on the evolution of adhesive capacities of a fish pathogen
- (2018) Agnieszka Wrobel et al. JOURNAL OF STRUCTURAL BIOLOGY
- The sea lamprey germline genome provides insights into programmed genome rearrangement and vertebrate evolution
- (2018) Jeramiah J. Smith et al. NATURE GENETICS
- Earth BioGenome Project: Sequencing life for the future of life
- (2018) Harris A. Lewin et al. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
- Bat Biology, Genomes, and the Bat1K Project: To Generate Chromosome-Level Genomes for All Living Bat Species
- (2018) Emma C. Teeling et al. Annual Review of Animal Biosciences
- Pushing the limits of de novo genome assembly for complex prokaryotic genomes harboring very long, near identical repeats
- (2018) Michael Schmid et al. NUCLEIC ACIDS RESEARCH
- Leucine Rich Repeat Proteins: Sequences, Mutations, Structures, and Diseases
- (2018) Norio Matsushima et al. PROTEIN AND PEPTIDE LETTERS
- Sequence-based diversity of 23 autosomal STR loci in Koreans investigated using an in-house massively parallel sequencing panel
- (2017) Eun Hye Kim et al. Forensic Science International-Genetics
- Evolution and Diversity of Transposable Elements in Vertebrate Genomes
- (2017) Cibele G. Sotero-Caio et al. Genome Biology and Evolution
- Combination of short-read, long-read, and optical mapping assemblies reveals large-scale tandem repeat arrays with population genetic implications
- (2017) Matthias H. Weissensteiner et al. GENOME RESEARCH
- De Novo Gene Evolution of Antifreeze Glycoproteins in Codfishes Revealed by Whole Genome Sequence Data
- (2017) Helle Tessand Baalsrud et al. MOLECULAR BIOLOGY AND EVOLUTION
- A study of the Bodrogköz population in north-eastern Hungary by Y chromosomal haplotypes and haplogroups
- (2017) Horolma Pamjav et al. MOLECULAR GENETICS AND GENOMICS
- Hidden genetic variation shapes the structure of functional elements in Drosophila
- (2017) Mahul Chakraborty et al. NATURE GENETICS
- GenBank
- (2017) Dennis A Benson et al. NUCLEIC ACIDS RESEARCH
- In silico characterization of tandem repeats in Trichophyton rubrum and related dermatophytes provides new insights into their role in pathogenesis
- (2017) Matheus Eloy Franco et al. Database-The Journal of Biological Databases and Curation
- Structure of a 1.5-MDa adhesin that binds its Antarctic bacterium to diatoms and ice
- (2017) Shuaiqi Guo et al. Science Advances
- Complete genome sequence and comparative genomics of the probiotic yeast Saccharomyces boulardii
- (2017) Indu Khatri et al. Scientific Reports
- Evolution of Hemoglobin Genes in Codfishes Influenced by Ocean Depth
- (2017) Helle Tessand Baalsrud et al. Scientific Reports
- Microsatellite landscape evolutionary dynamics across 450 million years of vertebrate genome evolution
- (2016) Richard H. Adams et al. GENOME
- Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown
- (2016) Mihaela Pertea et al. Nature Protocols
- RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures
- (2016) Lisanna Paladin et al. NUCLEIC ACIDS RESEARCH
- Structure and evolutionary history of a large family of NLR proteins in the zebrafish
- (2016) Kerstin Howe et al. Open Biology
- Microsatellite Length Scoring by Single Molecule Real Time Sequencing – Effects of Sequence Structure and PCR Regime
- (2016) Mikkel Meyn Liljegren et al. PLoS One
- TRAL: tandem repeat annotation library
- (2015) Elke Schaper et al. BIOINFORMATICS
- Comparative Analysis of Transposable Elements Highlights Mobilome Diversity and Evolution in Vertebrates
- (2015) Domitille Chalopin et al. Genome Biology and Evolution
- A global reference for human genetic variation
- (2015) Richard A. Gibbs et al. NATURE
- What's in a genome? The C-value enigma and the evolution of eukaryotic genome content
- (2015) Tyler A. Elliott et al. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES
- Annotation inconsistencies beyond sequence similarity-based function prediction – phylogeny and genome structure
- (2015) Vasilis J. Promponas et al. Standards in Genomic Sciences
- Current methods for automated annotation of protein-coding genes
- (2015) KJ Hoff et al. Current Opinion in Insect Science
- PacBio Sequencing and Its Applications
- (2015) Anthony Rhoads et al. GENOMICS PROTEOMICS & BIOINFORMATICS
- InterProScan 5: genome-scale protein function classification
- (2014) P. Jones et al. BIOINFORMATICS
- Genome-Wide Analysis of Simple Sequence Repeats in Marine Animals—a Comparative Approach
- (2014) Qun Jiang et al. MARINE BIOTECHNOLOGY
- Deep Conservation of Human Protein Tandem Repeats within the Eukaryotes
- (2014) Elke Schaper et al. MOLECULAR BIOLOGY AND EVOLUTION
- The evolution and function of protein tandem repeats in plants
- (2014) Elke Schaper et al. NEW PHYTOLOGIST
- UniProt: a hub for protein information
- (2014) NUCLEIC ACIDS RESEARCH
- PCR amplification of repetitive DNA: a limitation to genome editing technologies and many other applications
- (2014) Carl Maximilian Hommelsheim et al. Scientific Reports
- HRaP: database of occurrence of HomoRepeats and patterns in proteomes
- (2013) Mikhail Yu. Lobanov et al. NUCLEIC ACIDS RESEARCH
- Graph-based modeling of tandem repeats improves global multiple sequence alignment
- (2013) Adam M. Szalkowski et al. NUCLEIC ACIDS RESEARCH
- MAKER-P: A Tool Kit for the Rapid Creation, Management, and Quality Control of Plant Genome Annotations
- (2013) M. S. Campbell et al. PLANT PHYSIOLOGY
- Organization of lamprey variable lymphocyte receptor C locus and repertoire development
- (2013) S. Das et al. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
- Evolution of Hemoglobin and Its Genes
- (2013) R. C. Hardison Cold Spring Harbor Perspectives in Medicine
- Genome-Wide Analysis of Tandem Repeats in Plants and Green Algae
- (2013) Zhixin Zhao et al. G3-Genes Genomes Genetics
- Shining a Light on Dark Sequencing: Characterising Errors in Ion Torrent PGM Data
- (2013) Lauren M. Bragg et al. PLoS Computational Biology
- VLR-Based Adaptive Immunity
- (2012) Thomas Boehm et al. Annual Review of Immunology
- C-terminal low-complexity sequence repeats ofMycobacterium smegmatisKu modulate DNA binding
- (2012) Ambuj K. Kushwaha et al. BIOSCIENCE REPORTS
- Dissecting the role of low-complexity regions in the evolution of vertebrate proteins
- (2012) Núria Radó-Trilla et al. BMC EVOLUTIONARY BIOLOGY
- Protein genes in repetitive sequence—antifreeze glycoproteins in Atlantic cod genome
- (2012) Xuan Zhuang et al. BMC GENOMICS
- SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
- (2012) Anton Bankevich et al. JOURNAL OF COMPUTATIONAL BIOLOGY
- Whole-Genome Duplication and the Functional Diversification of Teleost Fish Hemoglobins
- (2012) Juan C. Opazo et al. MOLECULAR BIOLOGY AND EVOLUTION
- A beginner's guide to eukaryotic genome annotation
- (2012) Mark Yandell et al. NATURE REVIEWS GENETICS
- Repeat or not repeat?—Statistical validation of tandem repeat prediction in genomic sequences
- (2012) Elke Schaper et al. NUCLEIC ACIDS RESEARCH
- Direct Comparisons of Illumina vs. Roche 454 Sequencing Technologies on the Same Microbial Community DNA Sample
- (2012) Chengwei Luo et al. PLoS One
- Re-Evaluation of a Bacterial Antifreeze Protein as an Adhesin with Ice-Binding Activity
- (2012) Shuaiqi Guo et al. PLoS One
- MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects
- (2011) Carson Holt et al. BMC BIOINFORMATICS
- ALS51, a newly discovered gene in theCandida albicansALS family, created by intergenic recombination: analysis of the gene and protein, and implications for evolution of microbial gene families
- (2011) Xiaomin Zhao et al. FEMS IMMUNOLOGY AND MEDICAL MICROBIOLOGY
- Tandem repeats in proteins: From sequence to structure
- (2011) Andrey V. Kajava JOURNAL OF STRUCTURAL BIOLOGY
- Field guide to next-generation DNA sequencers
- (2011) TRAVIS C. GLENN Molecular Ecology Resources
- The genome sequence of Atlantic cod reveals a unique immune system
- (2011) Bastiaan Star et al. NATURE
- Full-length transcriptome assembly from RNA-Seq data without a reference genome
- (2011) Manfred G Grabherr et al. NATURE BIOTECHNOLOGY
- Repetitive DNA and next-generation sequencing: computational challenges and solutions
- (2011) Todd J. Treangen et al. NATURE REVIEWS GENETICS
- Reassessing Domain Architecture Evolution of Metazoan Proteins: Major Impact of Gene Prediction Errors
- (2011) Alinda Nagy et al. Genes
- Characteristics of 454 pyrosequencing data--enabling realistic simulation with flowsim
- (2010) S. Balzer et al. BIOINFORMATICS
- Genome-wide analysis of tandem repeats in Daphnia pulex - a comparative approach
- (2010) Christoph Mayer et al. BMC GENOMICS
- Protein tandem repeats - the more perfect, the less structured
- (2010) Julien Jorda et al. FEBS Journal
- The Next Generation of Molecular Markers From Massively Parallel Sequencing of Pooled DNA Samples
- (2010) Andreas Futschik et al. GENETICS
- Copy Number Variation Shapes Genome Diversity in Arabidopsis Over Immediate Family Generational Scales
- (2010) Seth DeBolt Genome Biology and Evolution
- Natural selection drives the accumulation of amino acid tandem repeats in human proteins
- (2010) L. Mularoni et al. GENOME RESEARCH
- Assembly algorithms for next-generation sequencing data
- (2010) Jason R. Miller et al. GENOMICS
- Centromere identity: a challenge to be faced
- (2010) Gunjan D. Mehta et al. MOLECULAR GENETICS AND GENOMICS
- Replication of individual DNA molecules under electronic control using a protein nanopore
- (2010) Felix Olasagasti et al. Nature Nanotechnology
- High-quality draft assemblies of mammalian genomes from massively parallel sequence data
- (2010) S. Gnerre et al. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
- Draft Genome Sequencing of Giardia intestinalis Assemblage B Isolate GS: Is Human Giardiasis Caused by Two Different Species?
- (2009) Oscar Franzén et al. PLoS Pathogens
- Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
- (2009) Michelle Simon et al. GENOME BIOLOGY
- Using native and syntenically mapped cDNA alignments to improve de novo gene finding
- (2008) Mario Stanke et al. BIOINFORMATICS
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
- (2008) D. R. Zerbino et al. GENOME RESEARCH
- Accurate whole human genome sequencing using reversible terminator chemistry
- (2008) David R. Bentley et al. NATURE
- Origin and Evolution of GALA-LRR, a New Member of the CC-LRR Subfamily: From Plants to Bacteria?
- (2008) Andrey V. Kajava et al. PLoS One
- Real-Time DNA Sequencing from Single Polymerase Molecules
- (2008) J. Eid et al. SCIENCE
- Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments
- (2008) Brian J Haas et al. GENOME BIOLOGY
Add your recorded webinar
Do you already have a recorded webinar? Grow your audience and get more views by easily listing your recording on Peeref.
Upload NowAsk a Question. Answer a Question.
Quickly pose questions to the entire community. Debate answers and get clarity on the most important issues facing researchers.
Get Started