4.7 Article

crabs-A software program to generate curated reference databases for metabarcoding sequencing data

Journal

MOLECULAR ECOLOGY RESOURCES
Volume 23, Issue 3, Pages 725-738

Publisher

WILEY
DOI: 10.1111/1755-0998.13741

Keywords

aDNA; ancient DNA; eDNA; environmental DNA; python; reference database curation; taxonomy assignment

Ask authors/readers for more resources

The measurement of biodiversity plays a vital role in life science research. However, the reliability and accuracy of taxonomic assignment in metabarcoding sequencing data greatly depend on the quality and completeness of reference databases. To address this issue, researchers have developed crabs, a software package that allows the creation of curated reference databases for metabarcoding studies.
The measurement of biodiversity is an integral aspect of life science research. With the establishment of second- and third-generation sequencing technologies, an increasing amount of metabarcoding data is being generated as we seek to describe the extent and patterns of biodiversity in multiple contexts. The reliability and accuracy of taxonomically assigning metabarcoding sequencing data have been shown to be critically influenced by the quality and completeness of reference databases. Custom, curated, eukaryotic reference databases, however, are scarce, as are the software programs for generating them. Here, we present crabs (Creating Reference databases for Amplicon-Based Sequencing), a software package to create custom reference databases for metabarcoding studies. crabs includes tools to download sequences from multiple online repositories (i.e., NCBI, BOLD, EMBL, MitoFish), retrieve amplicon regions through in silico PCR analysis and pairwise global alignments, curate the database through multiple filtering parameters (e.g., dereplication, sequence length, sequence quality, unresolved taxonomy, inclusion/exclusion filter), export the reference database in multiple formats for immediate use in taxonomy assignment software, and investigate the reference database through implemented visualizations for diversity, primer efficiency, reference sequence length, database completeness and taxonomic resolution. crabs is a versatile tool for generating curated reference databases of user-specified genetic markers to aid taxonomy assignment from metabarcoding sequencing data. crabs can be installed via docker and is available for download as a conda package and via GitHub ().

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemistry & Molecular Biology

Net overboard: Comparing marine eDNA sampling methodologies at sea to unravel marine biodiversity

Ulla von Ammon, Xavier Pochon, Paula Casanovas, Branwen Trochel, Martin Zirngibl, Austen Thomas, Jan Witting, Paul Joyce, Anastasija Zaiko

Summary: This study aimed to optimize environmental DNA (eDNA) sampling by comparing two different sampling methods and filters, and assessing their impact on biodiversity through metabarcoding analysis. The results showed that bucket sampling combined with self-preserving filters had the highest amplicon sequence variant (ASV) richness, while net sampling combined with 5μm pore size filters captured more metazoan taxa. These findings are important for optimizing eDNA sampling protocols in marine biodiversity research and surveillance.

MOLECULAR ECOLOGY RESOURCES (2023)

Article Biochemistry & Molecular Biology

Assessing the utility of marine filter feeders for environmental DNA (eDNA) biodiversity monitoring

Gert-Jan Jeunen, Jasmine S. Cane, Sara Ferreira, Francesca Strano, Ulla von Ammon, Hugh Cross, Robert Day, Sean Hesseltine, Kaleb Ellis, Lara Urban, Niall Pearson, Pamela Olmedo-Rojas, Anya Kardailsky, Neil J. Gemmell, Miles Lamare

Summary: Aquatic environmental DNA (eDNA) surveys are revolutionizing marine ecosystem monitoring, but the time-consuming active filtration step remains a bottleneck. This study investigated the similarities and differences between eDNA signals obtained from various sources, including water, filter-feeding organisms, and sponge material. The results showed that vertebrate eDNA signals from water and sponge samples were highly concordant, highlighting the potential of using marine sponges as an additional tool for eDNA-based biodiversity surveys. Caution should be taken to minimize the impact on marine communities during eDNA sampling.

MOLECULAR ECOLOGY RESOURCES (2023)

Article Fisheries

Genes involved in sex differentiation, epigenetic reprogramming, and cell fate regulate sex change in a wrasse

S. Muncaster, A. Goikoetxea, P. M. Lokman, C. E. Moraes, E. L. Damsteegt, J. Edgecombe, N. J. Gemmell, E. V. Todd

Summary: Socially induced sex change is regulated by a combination of genes and epigenetic factors that control sex differentiation and cell fate. The molecular basis for this transformation is still largely unknown. Recent research suggests that both epigenetic effects and genes involved in cell fate are important drivers of sex change.

REVIEWS IN FISH BIOLOGY AND FISHERIES (2023)

Article Fisheries

Environmental DNA metabarcoding describes biodiversity across marine gradients

Clare I. M. Adams, Gert-Jan Jeunen, Hugh Cross, Helen R. Taylor, Antoine Bagnaro, Kim Currie, Chris Hepburn, Neil J. Gemmell, Lara Urban, Federico Baltar, Michael Stat, Michael Bunce, Michael Knapp

Summary: In response to climate change, efficient monitoring methods are needed for rapidly shifting biodiversity patterns in the oceans. Environmental DNA (eDNA) metabarcoding has emerged as a cost-effective solution. Using eDNA, we detected four community types across a transect in the Southern Hemisphere and found that diversity patterns were mainly driven by planktonic organisms. This technique lays the foundations for multi-trophic environmental monitoring efforts.

ICES JOURNAL OF MARINE SCIENCE (2023)

Article Biochemistry & Molecular Biology

Formalin-fixed paraffin-embedded (FFPE) samples help to investigate transcriptomic responses in wildlife disease

Allison K. Miller, Cara L. Brosnahan, Anjali Pande, Cindy F. Baker, Jemma L. Geoghegan, Jane Kitson, Neil J. Gemmell, Edwina J. Dowle

Summary: Infectious diseases have a significant impact on various organisms, and understanding the interactions between hosts and pathogens is crucial for their conservation and management. The use of genomic approaches has made it easier to obtain this knowledge quickly, however, many species still face challenges in accessing appropriate samples and data. Archival materials, such as formalin-fixed paraffin-embedded tissue samples, may provide a valuable resource for studying pathogen emergence and host responses over long periods of time.

MOLECULAR ECOLOGY RESOURCES (2023)

Article Zoology

Neuroanatomy of a sex changing fish: the New Zealand spotty wrasse (Notolabrus celidotus) brain atlas

Kaj Kamstra, Chloe van der Burg, Haylee M. Quertermous, Simon Muncaster, Erica V. Todd, Christine L. Jasoni, Culum Brown, Neil J. Gemmell

Summary: For most vertebrates, sexual fate is genetically determined and remains fixed throughout life. However, for some teleost fishes sex is more plastic. Significant progress has been made in characterising the cellular and molecular processes that underpin gonadal sex change. The brain-mediated mechanisms that underlie and initiate this transformation, however, remain poorly understood.

NEW ZEALAND JOURNAL OF ZOOLOGY (2023)

Article Biochemistry & Molecular Biology

Bisulfite probing reveals DNA structural intricacies

Andrew T. M. Bagshaw, Neil J. Gemmell

Summary: In recent years, scientists have shifted their focus from studying the relationships between adjacent nucleotides to exploring the larger scale structure of DNA. A little-known technique called non-denaturing bisulfite modification of genomic DNA in conjunction with high-throughput sequencing has provided valuable insights. This technique has revealed a gradient in reactivity that increases towards the 5' end of poly-dC:dG mononucleotide repeats, suggesting the presence of positive-roll bending not predicted by existing models. Furthermore, these repeats are enriched at positions relative to the nucleosome dyad that bend towards the major groove, providing important information about DNA packaging.

NUCLEIC ACIDS RESEARCH (2023)

Article Cell Biology

An improved germline genome assembly for the sea lamprey Petromyzon marinus illuminates the evolution of germline-specific chromosomes

Nataliya Timoshevskaya, Kaan Eskut, Vladimir A. Timoshevskiy, Sofia M. C. Robb, Carson Holt, Jon E. Hess, Hugo J. Parker, Cindy F. Baker, Allison K. Miller, Cody Saraceno, Mark Yandell, Robb Krumlauf, Shawn R. Narum, Ralph T. Lampman, Neil J. Gemmell, Jacquelyn Mountcastle, Bettina Haase, Jennifer R. Balacco, Giulio Formenti, Sarah Pelan, Ying Sims, Kerstin Howe, Olivier Fedrigo, Erich D. Jarvis, Jeramiah J. Smith

Summary: Programmed DNA loss is a gene silencing mechanism found in various vertebrate and nonvertebrate lineages. The evolution of somatically eliminated sequences in these species has been difficult to reconstruct due to repetitive and duplicated sequences. However, an improved assembly of the sea lamprey genome has enabled analysis that sheds light on the recruitment of genes to the germline-specific fraction and reveals the roles of segmental duplication and positive selection in the long-term evolution of germline-specific chromosomes.

CELL REPORTS (2023)

Article Fisheries

Characterizing Antarctic fish assemblages using eDNA obtained from marine sponge bycatch specimens

Gert-Jan Jeunen, Miles Lamare, Jennifer Devine, Stefano Mariani, Sadie Mills, Jackson Treece, Sara Ferreira, Neil J. Gemmell

Summary: Given the challenges of monitoring the Southern Ocean through visual observations, this study explores the potential of marine sponge eDNA monitoring to assess the fish community in the region. The findings show that eDNA provides a more comprehensive view of the fish community compared to catch records, highlighting its potential for improving our understanding of this understudied ecosystem and aiding conservation efforts.

REVIEWS IN FISH BIOLOGY AND FISHERIES (2023)

Meeting Abstract Zoology

Comparison of Sphenodon punctatus and Tiliqua rugosa genomes reveals genomic basis of loss of.d T cells in Squamates

K. A. Morrissey, J. Samson, M. Rivera, L. Bu, V. L. Hansen, N. J. Gemmell, M. G. Gardner, T. Bertozzi, R. D. Miller

INTEGRATIVE AND COMPARATIVE BIOLOGY (2023)

Article Biochemistry & Molecular Biology

A framework for identifying fertility gene targets for mammalian pest control

Anna C. Clark, Rey Edison, Kevin Esvelt, Sebastian Kamau, Ludovic Dutoit, Jackson Champer, Samuel E. Champer, Philipp W. Messer, Alana Alexander, Neil J. Gemmell

Summary: This manuscript introduces a framework for identifying and evaluating target genes based on biological gene function, gene expression, and results from mouse knockout models. The framework identifies 16 genes essential for male fertility and 12 genes important for female fertility that may be feasible targets for mammalian gene drives and other genetic pest control technologies.

MOLECULAR ECOLOGY RESOURCES (2023)

No Data Available