4.7 Article

Kalign 3: multiple sequence alignment of large datasets

期刊

BIOINFORMATICS
卷 36, 期 6, 页码 1928-1929

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btz795

关键词

-

资金

  1. Feilman Foundation

向作者/读者索取更多资源

Motivation: Kalign is an efficient multiple sequence alignment (MSA) program capable of aligning thousands of protein or nucleotide sequences. However, current alignment problems involving large numbers of sequences are exceeding Kalign's original design specifications. Here we present a completely re-written and updated version to meet current and future alignment challenges. Results: Kalign now uses a SIMD (single instruction, multiple data) accelerated version of the bit-parallel Gene Myers algorithm to estimate pairwise distances, adopts a sequence embedding strategy and the bi-secting K-means algorithm to rapidly construct guide trees for thousands of sequences. The new version maintains high alignment accuracy on both protein and nucleotide alignments and scales better than other MSA tools.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemical Research Methods

Bilateral murine tumor models for characterizing the response to immune checkpoint blockade

Rachael M. Zemek, Vanessa S. Fear, Cath Forbes, Emma de Jong, Thomas H. Casey, Louis Boon, Timo Lassmann, Anthony Bosco, Michael J. Millward, Anna K. Nowak, Richard A. Lake, W. Joost Lesterhuis

NATURE PROTOCOLS (2020)

Article Multidisciplinary Sciences

Reference exome data for Australian Aboriginal populations to support health-based research

Alexia L. Weeks, Heather A. D'Antoine, Melita McKinnon, Genevieve Syn, Dawn Bessarab, Ngiare Brown, Steven Y. C. Tong, Bo Remenyi, Andrew Steer, Lesley-Ann Gray, Michael Inouye, Jonathan R. Carapetis, Jenefer M. Blackwell, Timo Lassmann

SCIENTIFIC DATA (2020)

Article Biochemistry & Molecular Biology

Comparative transcriptomics of primary cells in vertebrates

Tanvir Alam, Saumya Agrawal, Jessica Severin, Robert S. Young, Robin Andersson, Erik Arner, Akira Hasegawa, Marina Lizio, Jordan A. Ramilowski, Imad Abugessaisa, Yuri Ishizu, Shohei Noma, Hiroshi Tarui, Martin S. Taylor, Timo Lassmann, Masayoshi Itoh, Takeya Kasukawa, Hideya Kawaji, Luigi Marchionni, Guojun Sheng, Alistair R. R. Forrest, Levon M. Khachigian, Yoshihide Hayashizaki, Piero Carninci, Michiel J. L. de Hoon

GENOME RESEARCH (2020)

Article Immunology

Rhinovirus Infection Drives Complex Host Airway Molecular Responses in Children With Cystic Fibrosis

Kak-Ming Ling, Luke W. Garratt, Erin E. Gill, Amy H. Y. Lee, Patricia Agudelo-Romero, Erika N. Sutanto, Thomas Iosifidis, Tim Rosenow, Stuart E. Turvey, Timo Lassmann, Robert E. W. Hancock, Anthony Kicic, Stephen M. Stick

FRONTIERS IN IMMUNOLOGY (2020)

Review Immunology

Characteristics of TCR Repertoire Associated With Successful Immune Checkpoint Therapy Responses

Joel Kidman, Nicola Principe, Mark Watson, Timo Lassmann, Robert A. Holt, Anna K. Nowak, Willem Joost Lesterhuis, Richard A. Lake, Jonathan Chee

FRONTIERS IN IMMUNOLOGY (2020)

Article Multidisciplinary Sciences

Reference exome data for a Northern Brazilian population

Alexia L. Weeks, Richard W. Francis, Joao I. C. F. Neri, Nathaly M. C. Costa, Nivea M. R. Arrais, Timo Lassmann, Jenefer M. Blackwell, Selma M. B. Jeronimo

SCIENTIFIC DATA (2020)

Article Immunology

Common and Rare Genetic Variants That Could Contribute to Severe Otitis Media in an Australian Aboriginal Population

Sarra E. Jamieson, Michaela Fakiola, Dave Tang, Elizabeth Scaman, Genevieve Syn, Richard W. Francis, Harvey L. Coates, Denise Anderson, Timo Lassmann, Heather J. Cordell, Jenefer M. Blackwell

Summary: This study identified plausible genetic risk loci for severe otitis media in Aboriginal Australian children, including NR3C1 and NREP genes. Analysis also showed associations with variants influencing protein coding and pathways related to abnormal hair cell morphology, ear issues, integrin interactions, and cell projection phenotypes.

CLINICAL INFECTIOUS DISEASES (2021)

Article Multidisciplinary Sciences

Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network

Mathys Grapotte, Manu Saraswat, Chloe Bessiere, Christophe Menichelli, Jordan A. Ramilowski, Jessica Severin, Yoshihide Hayashizaki, Masayoshi Itoh, Michihira Tagami, Mitsuyoshi Murata, Miki Kojima-Ishiyamas, Shohei Noma, Shuhei Noguchi, Takeya Kasukawa, Akira Hasegawa, Harukazu Suzuki, Hiromi Nishiyori-Sueki, Martin C. Frith, Clement Chatelain, Piero Carninci, Michiel J. L. de Hoom, Wyeth W. Wasserman, Laurent Brehelin, Charles-Henri Lecellieree

Summary: The FANTOM5 consortium used CAGE technology to map transcription start sites, finding that a large portion of non-coding transcription initiates at microsatellites. They developed Cap Trap RNA-seq to confirm this transcription, and trained deep learning models to predict CAGE signal at STRs with high accuracy. These models revealed the importance of STR surrounding sequences for distinguishing STR classes and predicting transcription initiation levels.

NATURE COMMUNICATIONS (2021)

Article Medicine, General & Internal

Searching for a technology-driven acute rheumatic fever test: the START study protocol

Anna P. Ralph, Rachel Webb, Nicole J. Moreland, Reuben McGregor, Anthony Bosco, David Broadhurst, Timo Lassmann, Timothy C. Barnett, Rym Benothman, Jennifer Yan, Bo Remenyi, Julie Bennett, Nigel Wilson, Mark Mayo, Glenn Pearson, Tobias Kollmann, Jonathan R. Carapetis

Summary: The absence of a diagnostic test for acute rheumatic fever (ARF) is a major impediment in managing this serious childhood condition, which is the precursor to rheumatic heart disease (RHD), a leading cause of health inequity and premature mortality for Indigenous peoples. The 'Searching for a Technology-Driven Acute Rheumatic Fever Test' (START) study aims to discover biomarkers that can distinguish ARF cases from non-ARF cases and potentially transform the approach to ARF diagnosis and RHD prevention.

BMJ OPEN (2021)

Article Cell & Tissue Engineering

CRISPR single base editing, neuronal disease modelling and functional genomics for genetic variant analysis: pipeline validation using Kleefstra syndrome EHMT1 haploinsufficiency

Vanessa S. Fear, Catherine A. Forbes, Denise Anderson, Sebastian Rauschert, Genevieve Syn, Nicole Shaw, Sarra Jamieson, Michelle Ward, Gareth Baynam, Timo Lassmann

Summary: This study proposes a rapid genetic variant assessment pipeline utilizing CRISPR technology and functional genomics to support rare disease diagnosis and identify key disease mechanisms. The results demonstrate the effectiveness of the pipeline in identifying specific genetic variants associated with Kleefstra syndrome and revealing potential transcription factors relevant to disease pathogenesis.

STEM CELL RESEARCH & THERAPY (2022)

Article Genetics & Heredity

Functional validation of variants of unknown significance using CRISPR gene editing and transcriptomics: A Kleefstra syndrome case study

Vanessa S. Fear, Catherine A. Forbes, Denise Anderson, Sebastian Rauschert, Genevieve Syn, Nicole Shaw, Matthew E. Jones, Alistair R. R. Forrest, Gareth Baynam, Timo Lassmann

Summary: This study investigates the use of CRISPR gene editing and genome-wide transcriptomic profiling to facilitate patient diagnosis. The introduction of a variant in the EHMT1 gene allowed the researchers to identify gene expression changes related to Kleefstra syndrome and gain insight into the disease mechanism.
Article Genetics & Heredity

An expanded phenotype centric benchmark of variant prioritisation tools

Denise Anderson, Timo Lassmann

Summary: Identifying causal variants for genetic disease diagnosis is challenging, and variant prioritization tools can assist in this task. Our study found that these tools show differences in performance across different disease contexts, and we recommend three standout tools.

HUMAN MUTATION (2022)

Article Multidisciplinary Sciences

Temporally restricted activation of IFNβ signaling underlies response to immune checkpoint therapy in mice

Rachael M. Zemek, Wee Loong Chin, Vanessa S. Fear, Ben Wylie, Thomas H. Casey, Cath Forbes, Caitlin M. Tilsed, Louis Boon, Belinda B. Guo, Anthony Bosco, Alistair R. R. Forrest, Michael J. Millward, Anna K. Nowak, Richard A. Lake, Timo Lassmann, W. Joost Lesterhuis

Summary: The biological determinants of the response to immune checkpoint blockade (ICB) in cancer are not fully understood. This study shows that responding tumors are associated with transient IFN-beta signaling, which can promote the anti-tumor response. The authors also identify inflammatory monocytes as the primary source of IFN beta and find that active type-I-IFN signaling in tumor-infiltrating inflammatory monocytes is associated with T cell expansion in patients treated with ICB.

NATURE COMMUNICATIONS (2022)

Article Cell Biology

CD4+T cells drive an inflammatory, TNF-a/IFN-rich tumor microenvironment responsive to chemotherapy

Caitlin M. Tilsed, Nicola Principe, Joel Kidman, Wee Loong Chin, M. Lizeth Orozco Morales, Rachael M. Zemek, Jonathan Chee, Rasa Islam, Vanessa S. Fear, Catherine Forbes, Wayne J. Aston, Maud Jansen, Abha Chopra, Timo Lassmann, Anna K. Nowak, Scott A. Fisher, Richard A. Lake, W. Joost Lesterhuis

Summary: This study characterizes the chemotherapy-responsive tumor microenvironment in mice through RNA sequencing analysis on tumors before and after chemotherapy. It reveals that responsive tumors have a highly immune infiltrated and inflammatory pre-treatment tumor microenvironment. The same gene expression profile is also associated with response to cyclophosphamide-based chemotherapy in breast cancer patients.

CELL REPORTS (2022)

Article Biochemical Research Methods

SAMStat 2: quality control for next generation sequencing data

Timo Lassmann

Summary: SAMStat is an efficient program for extracting quality control metrics from fastq and SAM/BAM files. It displays sequence composition, base quality composition, and mapping error profiles based on mapping quality, allowing users to quickly identify reasons for poor mapping. A major update to SAMStat now supports paired-end and long-read data, with quality control plots drawn using the ploty javascript library.

BIOINFORMATICS (2023)

暂无数据