Article
Engineering, Biomedical
Meryem Altin Karagoz, O. Ufuk Nalbantoglu
Summary: A CNN approach based on k-mer representation was proposed for metagenomic fragment classification, utilizing Relative Abundance Index to represent DNA and deep learning algorithm for classification. The comparison with existing spectral methods showed competitive performance across various metagenomic datasets, indicating the effectiveness of the proposed method.
BIOMEDICAL SIGNAL PROCESSING AND CONTROL
(2021)
Article
Genetics & Heredity
Timothy Chappell, Shlomo Geva, James M. Hogan, David Lovell, Andrew Trotman, Dimitri Perrin
Summary: We propose a novel approach for the Metagenomic Geolocation Challenge that utilizes random projection of sample reads. This approach directly uses k-mer composition to characterize samples, eliminating the computationally demanding step of aligning reads to microbial reference sequences. Our findings demonstrate that k-mer representations carry sufficient information to determine the origin of metagenomic samples and that this reference-free approach requires less computation compared to previous methods.
FRONTIERS IN GENETICS
(2022)
Review
Biochemistry & Molecular Biology
Qingzhen Hou, Fabrizio Pucci, Fengming Pan, Fuzhong Xue, Marianne Rooman, Qiang Feng
Summary: This article reviews the application of metagenomic data in protein structure prediction and discovery. It introduces widely used metagenomic databases and analyzes how metagenomic data has contributed to the improvement of structure prediction methods. The article also discusses the role of metagenomes in the discovery of enzymes, new CRISPR-Cas systems, and antibiotic resistance genes.
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL
(2022)
Article
Computer Science, Theory & Methods
Shi Dong, Yifan Sun, Nicolas Bohm Agostini, Elmira Karimi, Daniel Lowell, Jing Zhou, Jose Cano, Jose L. Abellan, David Kaeli
Summary: The article presents Spartan, a lightweight hardware/software framework to accelerate DNN training on GPU by exploiting activation sparsity, reducing computations and improving efficiency. Spartan provides efficient tools such as sparsity monitor, sparse GEMM algorithm, and compaction engine, achieving significant reductions in sparsity profiling overhead and speeding up training on compute-intensive layers like convolutional layers.
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS
(2021)
Article
Multidisciplinary Sciences
Matteo Ciciani, Michele Demozzi, Eleonora Pedrazzoli, Elisabetta Visentin, Laura Pezze, Lorenzo Federico Signorini, Aitor Blanco-Miguez, Moreno Zolfo, Francesco Asnicar, Antonio Casini, Anna Cereseto, Nicola Segata
Summary: By developing a computational pipeline, researchers can accurately predict and isolate Cas9 nucleases targeting specific sequences, including using mutated sequences as PAM. This approach will be instrumental in generating a repertoire of Cas9 nucleases responding to any PAM requirement.
NATURE COMMUNICATIONS
(2022)
Article
Biochemistry & Molecular Biology
Fabian Schoeenfeld, Markus Stabrin, Tanvir R. Shaikh, Thorsten Wagner, Stefan Raunser
Summary: This paper presents GPU ISAC, a newly developed algorithm that uses GPU-acceleration to analyze single particles in electron microscopy data. Compared to existing methods, GPU ISAC can process large data sets and generate high quality class averages on a single desktop machine equipped with affordable GPUs.
FRONTIERS IN MOLECULAR BIOSCIENCES
(2022)
Article
Biochemistry & Molecular Biology
Silvio Weging, Andreas Gogol-Doring, Ivo Grosse
Summary: kASA is a tool based on k-mer, capable of efficiently identifying and profiling metagenomic DNA or protein sequences with high sensitivity and precision. Custom algorithms and data structures optimized for external memory storage enable a full-scale taxonomic analysis on various devices.
NUCLEIC ACIDS RESEARCH
(2021)
Article
Genetics & Heredity
David J. Burks, Vaidehi Pusadkar, Rajeev K. Azad
Summary: POSMM is a new Markov model-based classifier that reintroduces high sensitivity associated with alignment-free taxonomic classifiers. It is built on the top of a rapid Markov model based classification algorithm and generates logistic regression models using the Python sklearn library. POSMM is a valuable accompaniment to other programs as it features a dynamic database-free approach and is user-friendly and highly adaptable. By combining POSMM with ultrafast classifiers like Kraken2, higher overall accuracy in metagenomic sequence classification can be achieved.
ENVIRONMENTAL MICROBIOME
(2023)
Article
Multidisciplinary Sciences
Fabio F. de Oliveira, Leonardo A. Dias, Marcelo A. C. Fernandes
Summary: This article introduces a parallel hardware design used in bioinformatics to accelerate sequence alignment. The architecture utilizes a systolic array structure, reduces complexity by pre-calculating and storing paths, and achieves high-speed data processing.
Article
Microbiology
Mingji Lu, Dominik Schneider, Rolf Daniel
Summary: This study explores novel and/or extremophilic lipolytic enzymes by analyzing microbial consortia in composts through function-driven and sequence-based metagenomic approaches. The results reveal the diversity and distribution of lipolytic genes in metagenomes of various habitats, driven by ecological factors.
FRONTIERS IN MICROBIOLOGY
(2022)
Article
Computer Science, Hardware & Architecture
Kisaru Liyanage, Hasindu Gamaarachchi, Roshan Ragel, Sri Parameswaran
Summary: This article introduces a novel heterogeneous computing system combining an Intel FPGA-based hardware accelerator and a CPU to accelerate the chaining step in DNA sequence analysis. The system achieves up to similar to 1.35x performance improvement and consumes similar to 27% less energy when handling large-realistic workloads. Compared to the software solution running on the CPU without SIMD intrinsics, the system performs similar to 1.9x faster while consuming similar to 38% less energy. Importantly, the accuracy of the output generated is not compromised for the gained speed-up.
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS
(2023)
Article
Biochemical Research Methods
Jim Shaw, Yun William Yu
Summary: This article introduces skani, a sequence comparison tool for metagenome-assembled genomes (MAGs) that overcomes the challenges of high-volume or low-quality data. skani outperforms FastANI in terms of accuracy and speed, being more than 20 times faster for fragmented and incomplete MAGs. It can query genomes against over 65,000 prokaryotic genomes in a matter of seconds and with only 6 GB of memory. skani unlocks higher-resolution insights for extensive and noisy metagenomic datasets.
Article
Multidisciplinary Sciences
Aldair E. Gongora, Kelsey L. Snapp, Emily Whiting, Patrick Riley, Kristofer G. Reyes, Elise F. Morgan, Keith A. Brown
Summary: Autonomous experimentation (AE) combines automation and machine learning to conduct experiments intelligently and rapidly in a sequential manner. This study explores whether imperfect data from simulation can accelerate AE, focusing on the mechanics of additively manufactured structures. The research shows that simulation data can be used to improve the efficiency of AE through transfer learning methods.
Article
Computer Science, Information Systems
Koustubh Phalak, Swaroop Ghosh
Summary: Quantum Machine Learning (QML) is a rapidly growing field that combines Quantum Computing (QC) and Machine Learning (ML). This paper proposes a shot optimization method for QML models, which can reduce the number of shots without significantly impacting the model performance. The method is tested on MNIST and FMNIST datasets for classification tasks, and also applied to ground state energy estimation of molecules.
Article
Multidisciplinary Sciences
John T. Lovell, Nolan B. Bentley, Gaurab Bhattarai, Jerry W. Jenkins, Avinash Sreedasyam, Yanina Alarcon, Clive Bock, Lori Beth Boston, Joseph Carlson, Kimberly Cervantes, Kristen Clermont, Sara Duke, Nick Krom, Keith Kubenka, Sujan Mamidi, Christopher P. Mattison, Maria J. Monteros, Cristina Pisani, Christopher Plott, Shanmugam Rajasekar, Hormat Shadgou Rhein, Charles Rohla, Mingzhou Song, Rolston St. Hilaire, Shengqiang Shu, Lenny Wells, Jenell Webber, Richard J. Heerema, Patricia E. Klein, Patrick Conner, Xinwang Wang, L. J. Grauke, Jane Grimwood, Jeremy Schmutz, Jennifer J. Randall
Summary: Researchers assembled diploid genomes of four outbred pecan genotypes, identified interspecific introgressions through comparative genomics analyses, and mapped QTLs associated with pest resistance. By leveraging pan-genome presence-absence and functional annotation database, candidate genes related to pest resistance were identified.
NATURE COMMUNICATIONS
(2021)
Article
Biotechnology & Applied Microbiology
Yizhe Zhang, Yupeng He, Guangyong Zheng, Chaochun Wei
Article
Multidisciplinary Sciences
Zhiqiang Hu, Hamish S. Scott, Guangrong Qin, Guangyong Zheng, Xixia Chu, Lu Xie, David L. Adelson, Bergithe E. Oftedal, Parvathy Venugopal, Milena Babic, Christopher N. Hahn, Bing Zhang, Xiaojing Wang, Nan Li, Chaochun Wei
SCIENTIFIC REPORTS
(2015)
Article
Biochemistry & Molecular Biology
Chen Sun, Zhiqiang Hu, Tianqing Zheng, Kuangchen Lu, Yue Zhao, Wensheng Wang, Jianxin Shi, Chunchao Wang, Jinyuan Lu, Dabing Zhang, Zhikang Li, Chaochun Wei
NUCLEIC ACIDS RESEARCH
(2017)
Article
Multidisciplinary Sciences
Xiaoyong Li, Zhiqiang Hu, Xuelin Yu, Chen Zhang, Binbin Ma, Lin He, Chaochun Wei, Ji Wu
SCIENTIFIC REPORTS
(2017)
Article
Parasitology
Bikash Ranjan Giri, Jiannan Ye, Yongjun Chen, Chaochun Wei, Guofeng Cheng
PARASITOLOGY RESEARCH
(2018)
Article
Multidisciplinary Sciences
Zhiqiang Hu, Wensheng Wang, Zhichao Wu, Chen Sun, Min Li, Jinyuan Lu, Binying Fu, Jianxin Shi, Jianlong Xu, Jue Ruan, Chaochun Wei, Zhikang Li
Article
Biology
Yuyang Qiao, Ben Jia, Zhiqiang Hu, Chen Sun, Yijin Xiang, Chaochun Wei
Article
Multidisciplinary Sciences
Wensheng Wang, Ramil Mauleon, Zhiqiang Hu, Dmytro Chebotarov, Shuaishuai Tai, Zhichao Wu, Min Li, Tianqing Zheng, Roven Rommel Fuentes, Fan Zhang, Locedie Mansueto, Dario Copetti, Millicent Sanciangco, Kevin Christian Palis, Jianlong Xu, Chen Sun, Binying Fu, Hongliang Zhang, Yongming Gao, Xiuqin Zhao, Fei Shen, Xiao Cui, Hong Yu, Zichao Li, Miaolin Chen, Jeffrey Detras, Yongli Zhou, Xinyuan Zhang, Yue Zhao, Dave Kudrna, Chunchao Wang, Rui Li, Ben Jia, Jinyuan Lu, Xianchang He, Zhaotong Dong, Jiabao Xu, Yanhong Li, Miao Wang, Jianxin Shi, Jing Li, Dabing Zhang, Seunghee Lee, Wushu Hu, Alexander Poliakov, Inna Dubchak, Victor Jun Ulat, Frances Nikki Borja, John Robert Mendoza, Jauhar Ali, Jing Li, Qiang Gao, Yongchao Niu, Zhen Yue, Ma. Elizabeth B. Naredo, Jayson Talag, Xueqiang Wang, Jinjie Li, Xiaodong Fang, Ye Yin, Jean-Christophe Glaszmann, Jianwei Zhang, Jiayang Li, Ruaraidh Sackville Hamilton, Rod A. Wing, Jue Ruan, Gengyun Zhang, Chaochun Wei, Nickolai Alexandrov, Kenneth L. McNally, Zhikang Li, Hei Leung
Article
Biochemical Research Methods
Wenmin Zhang, Ben Jia, Chaochun Wei
BMC BIOINFORMATICS
(2019)
Article
Biotechnology & Applied Microbiology
Fazhe Yan, Xuelin Yu, Zhongqu Duan, Jinyuan Lu, Ben Jia, Yuyang Qiao, Chen Sun, Chaochun Wei
Article
Biochemical Research Methods
Van-Kien Bui, Chaochun Wei
BMC BIOINFORMATICS
(2020)
Article
Biochemistry & Molecular Biology
Xiaorui Dong, Hongzhang Xue, Chaochun Wei
Summary: ivTerm is an R-shiny package that allows users to visualize functional analysis results, compare results across multiple experiments, create customized charts, and download these charts. It provides users with various basic and innovative chart types to display functional terms and involved genes.
JOURNAL OF CELLULAR BIOCHEMISTRY
(2021)
Article
Biochemistry & Molecular Biology
Fan Zhang, Hongzhang Xue, Xiaorui Dong, Min Li, Xiaoming Zheng, Zhikang Li, Jianlong Xu, Wensheng Wang, Chaochun Wei
Summary: This study introduced new steps for handling long-read data and constructed a high-quality rice pan-genome that is more comprehensive than the one based on short-read sequencing. The main components of novel sequences are repetitive sequences, and the pan-genome constructed from long-read data is more representative than the one constructed from short-read data.
Article
Biochemical Research Methods
Lu Zeng, Stephen M. Pederson, Danfeng Cao, Zhipeng Qu, Zhiqiang Hu, David L. Adelson, Chaochun Wei
JOURNAL OF COMPUTATIONAL BIOLOGY
(2018)
Article
Biotechnology & Applied Microbiology
Wenze Huang, Lillian Tsai, Yulong Li, Nan Hua, Chen Sun, Chaochun Wei