4.7 Article

ODGI: understanding pangenome graphs

期刊

BIOINFORMATICS
卷 38, 期 13, 页码 3319-3326

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac308

关键词

-

资金

  1. National Institutes of Health/NIDA [U01DA047638]
  2. National Institutes of Health/NIGMS [R01GM123489]
  3. NSF PPoSS Award [2118709]
  4. Central Innovation Programme (ZIM) for SMEs of the Federal Ministry for Economic Affairs and Energy of Germany
  5. Germany's Excellence Strategy (CMFI) [EXC2124, (iFIT)-EXC 2180-390900677]
  6. BMBF [031A537B, 031A533A, 031A538A, 031A533B, 031A535A, 031A537C, 031A534A, 031A532B]

向作者/读者索取更多资源

Pangenome graphs provide a complete representation of genomic diversity, but analyzing large-scale genome data using existing tools is challenging. Optimized Dynamic Genome/Graph Implementation (ODGI) is a new tool suite with efficient in-memory representation and support for various operations and visualization. Its parallel execution helps answer complex biological questions quickly.
Motivation: Pangenome graphs provide a complete representation of the mutual alignment of collections of genomes. These models offer the opportunity to study the entire genomic diversity of a population, including structurally complex regions. Nevertheless, analyzing hundreds of gigabase-scale genomes using pangenome graphs is difficult as it is not well-supported by existing tools. Hence, fast and versatile software is required to ask advanced questions to such data in an efficient way. Results: We wrote Optimized Dynamic Genome/Graph Implementation (ODGI), a novel suite of tools that implements scalable algorithms and has an efficient in-memory representation of DNA pangenome graphs in the form of variation graphs. ODGI supports pre-built graphs in the Graphical Fragment Assembly format. ODGI includes tools for detecting complex regions, extracting pangenomic loci, removing artifacts, exploratory analysis, manipulation, validation and visualization. Its fast parallel execution facilitates routine pangenomic tasks, as well as pipelines that can quickly answer complex biological questions of gigabase-scale pangenome graphs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemistry & Molecular Biology

Control of replication stress and mitosis in colorectal cancer stem cells through the interplay of PARP1, MRE11 and RAD51

Gwenola Manic, Martina Musella, Francesca Corradi, Antonella Sistigu, Sara Vitale, Sara Soliman Abdel Rehim, Luca Mattiello, Eva Malacaria, Claudia Galassi, Michele Signore, Matteo Pallocca, Stefano Scalera, Frauke Goeman, Francesca De Nicola, Andrea Guarracino, Rosa Pennisi, Fabrizio Antonangeli, Francesca Sperati, Marta Baiocchi, Mauro Biffoni, Maurizio Fanciulli, Marcello Maugeri-Sacca, Annapaola Franchitto, Pietro Pichierri, Ruggero De Maria, Ilio Vitale

Summary: This passage discusses the crucial role of cancer stem cells in colorectal cancer and the research findings on PARP1 and MRE11/RAD51, providing new insights for tumor treatment.

CELL DEATH AND DIFFERENTIATION (2021)

Article Cell Biology

Inhibition of HECT E3 ligases as potential therapy for COVID-19

Giuseppe Novelli, Jing Liu, Michela Biancolella, Tonino Alonzi, Antonio Novelli, J. J. Patten, Dario Cocciadiferro, Emanuele Agolini, Vito Luigi Colona, Barbara Rizzacasa, Rosalinda Giannini, Benedetta Bigio, Delia Goletti, Maria Rosaria Capobianchi, Sandro Grelli, Justin Mann, Trevor D. McKee, Ke Cheng, Fatima Amanat, Florian Krammer, Andrea Guarracino, Gerardo Pepe, Carlo Tomino, Yacine Tandjaoui-Lambiotte, Yurdagul Uzunhan, Sarah Tubiana, Jade Ghosn, Luigi D. Notarangelo, Helen C. Su, Laurent Abel, Aurelie Cobat, Gai Elhanan, Joseph J. Grzymski, Andrea Latini, Sachdev S. Sidhu, Suresh Jain, Robert A. Davey, Jean-Laurent Casanova, Wenyi Wei, Pier Paolo Pandolfi

Summary: HECT family E3 ligases are identified as potential biomarkers for COVID-19, with rare germline activating variants in NEDD4 and WWP1 genes associated with severe cases. Natural inhibitors I3C from Brassicaceae show potent antiviral effects against SARS-CoV-2, making them potential targets for therapeutic strategies easily testable in clinical trials.

CELL DEATH & DISEASE (2021)

Article Biochemistry & Molecular Biology

BRIO: a web server for RNA sequence and structure motif scan

Andrea Guarracino, Gerardo Pepe, Francesco Ballesio, Marta Adinolfi, Marco Pietrosanto, Elisa Sangiovanni, Ilio Vitale, Gabriele Ausiello, Manuela Helmer-Citterich

Summary: The interaction between RNA and RNA-binding proteins is crucial in gene expression regulation and other biological processes. BRIO is a new web server designed to identify sequence and structure RNA-binding motifs in RNA molecules, providing access to over 2508 sequence motifs and 2296 secondary structure motifs identified in humans and mice.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

Evaluation of potential miRNA sponge effects of SARS genomes in human

G. Pepe, A. Guarracino, F. Ballesio, L. Parca, G. Ausiello, M. Helmer-Citterich

Summary: This study explores the potential interactions between human miRNAs and coronaviruses. The findings suggest that certain miRNA binding sites may be associated with deep vein thrombosis and cardiovascular diseases, particularly in SARS-CoV-2 and other pathogenic viruses.

NON-CODING RNA RESEARCH (2022)

Article Oncology

Multiomics surface receptor profiling of the NCI-60 tumor cell panel uncovers novel theranostics for cancer immunotherapy

Simon Heumos, Sandra Dehn, Konstantin Braeutigam, Marius C. Codrea, Christian M. Schuerch, Ulrich M. Lauer, Sven Nahnsen, Michael Schindler

Summary: This study identified specific cell surface receptor biomarkers for colon and renal cancer through analyzing the surface receptors of tumor cells, with CD106 (VCAM1) identified as a promising novel immunotherapeutic target for renal cancer treatment.

CANCER CELL INTERNATIONAL (2022)

Article Immunology

Type I IFNs promote cancer cell stemness by triggering the epigenetic regulator KDM1B

Martina Musella, Andrea Guarracino, Nicoletta Manduca, Claudia Galassi, Eliana Ruggiero, Alessia Potenza, Ester Maccafeo, Gwenola Manic, Luca Mattiello, Sara Soliman Abdel Rehim, Michele Signore, Marco Pietrosanto, Manuela Helmer-Citterich, Matteo Pallocca, Maurizio Fanciulli, Tiziana Bruno, Francesca De Nicola, Giacomo Corleone, Anna Di Benedetto, Cristiana Ercolani, Edoardo Pescarmona, Laura Pizzuti, Francesco Guidi, Francesca Sperati, Sara Vitale, Daniele Macchia, Massimo Spada, Giovanna Schiavoni, Fabrizio Mattei, Adele De Ninno, Luca Businaro, Valeria Lucarini, Laura Bracci, Eleonora Arico, Giovanna Ziccheddu, Francesco Facchiano, Stefania Rossi, Massimo Sanchez, Alessandra Boe, Mauro Biffoni, Ruggero De Maria, Ilio Vitale, Antonella Sistigu

Summary: The study demonstrates that type I interferons can promote the formation of cancer stem cells by upregulating the chromatin remodeling factor KDM1B. Inhibition of KDM1B could potentially prevent stem cell expansion and increase the long-term benefit of therapy.

NATURE IMMUNOLOGY (2022)

Article Biotechnology & Applied Microbiology

Pangenome graph construction from genome alignments with Minigraph-Cactus

Glenn Hickey, Jean Monlong, Jana Ebler, Adam M. Novak, Jordan M. Eizenga, Yan Gao, Tobias Marschall, Heng Li, Benedict Paten, Haley J. Abel, Lucinda L. Antonacci-Fulton, Mobin Asri, Gunjan Baid, Carl A. Baker, Anastasiya Belyaeva, Konstantinos Billis, Guillaume Bourque, Silvia Buonaiuto, Andrew Carroll, Mark J. P. Chaisson, Pi-Chuan Chang, Xian H. Chang, Haoyu Cheng, Justin Chu, Sarah Cody, Vincenza Colonna, Daniel E. Cook, Robert M. Cook-Deegan, Omar E. Cornejo, Mark Diekhans, Daniel Doerr, Peter Ebert, Jana Ebler, Evan E. Eichler, Jordan M. Eizenga, Susan Fairley, Olivier Fedrigo, Adam L. Felsenfeld, Xiaowen Feng, Christian Fischer, Paul Flicek, Giulio Formenti, Adam Frankish, Robert S. Fulton, Yan Gao, Shilpa Garg, Erik Garrison, Nanibaa' A. Garrison, Carlos Garcia Giron, Richard E. Green, Cristian Groza, Andrea Guarracino, Leanne Haggerty, Ira M. Hall, William T. Harvey, Marina Haukness, David Haussler, Simon Heumos, Glenn Hickey, Kendra Hoekzema, Thibaut Hourlier, Kerstin Howe, Miten Jain, Erich D. Jarvis, Hanlee P. Ji, Eimear E. Kenny, Barbara A. Koenig, Alexey Kolesnikov, Jan O. Korbel, Jennifer Kordosky, Sergey Koren, HoJoon Lee, Alexandra P. Lewis, Wen-Wei Liao, Shuangjia Lu, Tsung-Yu Lu, Julian K. Lucas, Magalhaes Hugo, Marco-Sola Santiago, Pierre Marijon, Charles Markello, Tobias Marschall, Fergal J. Martin, Ann McCartney, Jennifer McDaniel, Karen H. Miga, Matthew W. Mitchell, Jean Monlong, Jacquelyn Mountcastle, Katherine M. Munson, Moses Njagi Mwaniki, Maria Nattestad, Adam M. Novak, Sergey Nurk, Hugh E. Olsen, Nathan D. Olson, Trevor Pesout, Adam M. Phillippy, Alice B. Popejoy, David Porubsky, Pjotr Prins, Daniela Puiu, Mikko Rautiainen, Allison A. Regier, Arang Rhie, Samuel Sacco, Ashley D. Sanders, Valerie A. Schneider, Baergen Schultz, Kishwar Shafin, Jonas A. Sibbesen, Jouni Siren, Michael W. Smith, Heidi J. Sofia, Ahmad N. Abou Tayoun, Francoise Thibaud-Nissen, Chad Tomlinson, Francesca Floriana Tricomi, Flavia Villani, Mitchell R. Vollger, Justin Wagner, Brian Walenz, Ting Wang, Jonathan M. D. Wood, Aleksey Zimin, Justin M. Zook

Summary: Genome assemblies are used to directly construct genome graphs, which can represent various forms of genetic variation and improve analysis accuracy by overcoming single-reference bias.

NATURE BIOTECHNOLOGY (2023)

Article Biochemical Research Methods

Optimal gap-affine alignment in O(s) space

Santiago Marco-Sola, Jordan M. Eizenga, Andrea Guarracino, Benedict Paten, Erik Garrison, Miquel Moreto

Summary: Pairwise sequence alignment is a fundamental problem in computational biology and bioinformatics. Recent advances in genomics and sequencing technologies call for faster and scalable algorithms to handle longer sequences. The wavefront alignment algorithm (WFA) proposed an efficient approach to perform exact gap-affine alignment, but its memory requirements were computationally impractical for genome-scale alignments. In this study, the bidirectional WFA algorithm is introduced, which can compute optimal alignments with significantly reduced memory usage while maintaining the time complexity of WFA.

BIOINFORMATICS (2023)

Article Biotechnology & Applied Microbiology

A super-pangenome of the North American wild grape species

Noe Cochetel, Andrea Minio, Andrea Guarracino, Jadran F. Garcia, Rosa Figueroa-Balderas, Melanie Massonnet, Takao Kasuga, Jason P. Londo, Erik Garrison, Brandon S. Gaut, Dario Cantu

Summary: This study reports the sequencing, assembly, and annotation of wild North American grape genomes, and demonstrates the utility of a reference-unbiased super-pangenome in mapping short reads and identifying loci associated with salt tolerance. It highlights the importance of capturing the genetic diversity of wild relatives for crop improvement research.

GENOME BIOLOGY (2023)

Article Biochemical Research Methods

Unbiased pangenome graphs

Erik Garrison, Andrea Guarracino

Summary: This study presents the seqwish algorithm, which can build a variation graph from a set of sequences and their alignments. By transforming the alignment set into a tree-based representation and querying this representation, the algorithm constructs a variation graph, resulting in a pangenome variation graph. The method is scalable and has been successfully applied to build pangenome graphs for multiple species.

BIOINFORMATICS (2023)

Article Genetics & Heredity

nf-core/mag: a best-practice pipeline for metagenome hybrid assembly and binning

Sabrina Krakau, Daniel Straub, Hadrien Gourle, Gisela Gabernet, Sven Nahnsen

Summary: The study presents the nf-core/mag pipeline for metagenome assembly, binning, and taxonomic classification. The pipeline is characterized by its ease of installation, portability, and reproducibility, and it utilizes sample-wise group information to enhance assembly continuity.

NAR GENOMICS AND BIOINFORMATICS (2022)

Article Genetics & Heredity

Relative Information Gain: Shannon entropy-based measure of the relative structural conservation in RNA alignments

Marco Pietrosanto, Marta Adinolfi, Andrea Guarracino, Fabrizio Ferre, Gabriele Ausiello, Ilio Vitale, Manuela Helmer-Citterich

Summary: The study introduces a method for modeling the structural features of RNA, expanding alignment sets and introducing a new scoring metric to encode RNA structural information at different granularities. These approaches can be used in conjunction with other tools to gain insights into the structural elements of RNA.

NAR GENOMICS AND BIOINFORMATICS (2021)

暂无数据