4.7 Article

W-AlignACE: an improved Gibbs sampling algorithm based on more accurate position weight matrices learned from sequence and gene expression/ChIP-chip data

期刊

BIOINFORMATICS
卷 24, 期 9, 页码 1121-1128

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btn088

关键词

-

资金

  1. NLM NIH HHS [LM008991-01, R01 LM008991-03, R01 LM008991] Funding Source: Medline

向作者/读者索取更多资源

Motivation: Position weight matrices (PWMs) are widely used to depict the DNA binding preferences of transcription factors (TFs) in computational molecular biology and regulatory genomics. Thus, learning an accurate PWM to characterize the binding sites of a specific TF is a fundamental problem that plays an important role in modeling regulatory motifs and also in discovering the regulatory targets of TFs. Results: We study the question of how to learn a more accurate PWM from both binding sequences and gene expression (or ChIP-chip) data, and propose to find a PWM such that the likelihood of simultaneously observing both binding sequences and their associated gene expression (or ChIP-chip) data is maximised. To solve the above maximum likelihood problem, a sequence weighting scheme is thus introduced based on the observation that binding sites inducing drastic fold changes in mRNA expression (or showing strong binding ratios in ChIP experiments) are likely to represent a true motif. We have incorporated this new learning approach into the popular motif finding program AlignACE. The modified program, called W-AlignACE, is compared with three other programs (AlignACE, MDscan and MotifRegressor) on a variety of datasets, including simulated data, mRNA expression and ChIP-chip data. These tests demonstrate that W-AlignACE is an effective tool for discovering TF binding motifs from gene expression (or ChIP-chip) data and, in particular, has the ability to find very weak motifs like DIG1 and GAL4. Availability: http://www.ntu.edu.sg/home/ChenXin/Gibbs Contact: chenxin@ntu.edu.sg Supplementary information: Supplementary data are available at Bioinformatics online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemical Research Methods

Quantifying functional impact of non-coding variants with multi-task Bayesian neural network

Chencheng Xu, Qiao Liu, Jianyu Zhou, Minzhu Xie, Jianxing Feng, Tao Jiang

BIOINFORMATICS (2020)

Article Multidisciplinary Sciences

Somatic SF3B1 hotspot mutation in prolactinomas

Chuzhong Li, Weiyan Xie, Jared S. Rosenblum, Jianyu Zhou, Jing Guo, Yazhou Miao, Yutao Shen, Hongyun Wang, Lei Gong, Mingxuan Li, Sida Zhao, Sen Cheng, Haibo Zhu, Tao Jiang, Shiying Ling, Fei Wang, Hongwei Zhang, Mingshan Zhang, Yanming Qu, Qi Zhang, Guilin Li, Junmei Wang, Jun Ma, Zhengping Zhuang, Yazhuo Zhang

NATURE COMMUNICATIONS (2020)

Article Biochemistry & Molecular Biology

MONN: A Multi-objective Neural Network for Predicting Compound-Protein Interactions and Affinities

Shuya Li, Fangping Wan, Hantao Shu, Tao Jiang, Dan Zhao, Jianyang Zeng

CELL SYSTEMS (2020)

Article Biochemical Research Methods

DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms

Dipan Shaw, Hao Chen, Minzhu Xie, Tao Jiang

Summary: The DeepLPI method uses a hybrid framework that integrates sequence, structure, and expression data to predict interactions between lncRNAs and protein isoforms. By combining multimodal deep learning neural network and conditional random field, along with using multiple instance learning approach, DeepLPI has improved prediction performance significantly. Further correlation analyses also demonstrated the effectiveness of co-expression information in predicting interactions.

BMC BIOINFORMATICS (2021)

Article Biochemical Research Methods

Riboexp: an interpretable reinforcement learning framework for ribosome density modeling

Hailin Hu, Xianggen Liu, An Xiao, YangYang Li, Chengdong Zhang, Tao Jiang, Dan Zhao, Sen Song, Jianyang Zeng

Summary: Riboexp is a novel deep reinforcement learning-based framework that successfully models the uneven distribution of ribosomes on mRNA during translation elongation, outperforming existing methods in predicting ribosome density. The application of Riboexp in codon optimization significantly increases protein production, while also providing meaningful biological insights through in-depth analyses.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Genetics & Heredity

Characterizing RNA Pseudouridylation by Convolutional Neural Networks

Xuan He, Sai Zhang, Yanqing Zhang, Zhixin Lei, Tao Jiang, Jianyang Zeng

Summary: The study introduced a model named PULSE based on convolutional neural network for analyzing large-scale Psi site data and characterizing the sequence features of pseudouridylation. The validation tests showed that PULSE outperformed other prediction methods, providing novel insights into the functional roles of pseudouridylation and enabling further research on the transcriptome-wide landscape of Psi sites.

GENOMICS PROTEOMICS & BIOINFORMATICS (2021)

Article Biochemistry & Molecular Biology

Modeling multi-species RNA modification through multi-task curriculum learning

Yuanpeng Xiong, Xuan He, Dan Zhao, Tingzhong Tian, Lixiang Hong, Tao Jiang, Jianyang Zeng

Summary: N-6-methyladenosine (m(6)A) is the most prevalent modification in eukaryotic mRNAs, regulating various biological processes. The computational framework MASS, based on multi-task curriculum learning, outperforms existing methods in capturing m(6)A features across multiple species. MASS also helps elucidate m(6)A similarities and differences among species and their relationships with gene regulation properties.

NUCLEIC ACIDS RESEARCH (2021)

Article Plant Sciences

Transcriptome Profiling of 'Candidatus Liberibacter asiaticus' in Citrus and Psyllids

Agustina De Francesco, Amelia H. Lovelace, Dipan Shaw, Min Qiu, Yuanchao Wang, Fatta Gurung, Veronica Ancona, Chunxia Wang, Amit Levy, Tao Jiang, Wenbo Ma

Summary: This study determined the expression profiles of 'Candidatus Liberibacter asiaticus' (Las) genes using a bacterial cell enrichment procedure. The results revealed highly expressed genes in citrus and differentially expressed genes between citrus and Asian citrus psyllids. The study provides insights into the biology of Huanglongbing and the interactions between Las, its plant host, and insect vector.

PHYTOPATHOLOGY (2022)

Article Biochemistry & Molecular Biology

A Global Analysis of Alternative Splicing of Dichocarpum Medicinal Plants, Ranunculales

Da-Cheng Hao, Hao Chen, Pei-Gen Xiao, Tao Jiang

Summary: In this study, the first global analysis of AS events in Dichocarpum was conducted using full-length transcriptome datasets of five Chinese endemic species. The research identified numerous AS events and successfully predicted the functions of AS isoforms.

CURRENT GENOMICS (2022)

Article Genetics & Heredity

FINER: enhancing the prediction of tissue-specific functions of isoforms by refining isoform interaction networks

Hao Chen, Dipan Shaw, Dongbo Bu, Tao Jiang

Summary: Annotating the functions of gene products is crucial in biology, with various databases established for recording functional knowledge at the gene level. There is a growing demand for functional annotations at the isoform resolution in many biological applications. Prediction of isoform functions and isoform-isoform interactions have been treated as separate computational problems, but they are actually intertwined and could benefit from each other.

NAR GENOMICS AND BIOINFORMATICS (2021)

Proceedings Paper Computer Science, Artificial Intelligence

End-to-End Unpaired Image Denoising with Conditional Adversarial Networks

Zhiwei Hong, Xiaocheng Fan, Tao Jiang, Jianxing Feng

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE (2020)

Article Mathematical & Computational Biology

IRIS: A method for predicting in vivo RNA secondary structures using PARIS data

Jianyu Zhou, Pan Li, Wanwen Zeng, Wenxiu Ma, Zhipeng Lu, Rui Jiang, Qiangfeng Cliff Zhang, Tao Jiang

QUANTITATIVE BIOLOGY (2020)

Article Mathematical & Computational Biology

A simulated annealing approach for resolution guided homogeneous cryo-electron microscopy image selection

Jie Shi, Xiangrui Zeng, Rui Jiang, Tao Jiang, Min Xu

QUANTITATIVE BIOLOGY (2020)

Article Endocrinology & Metabolism

Dysregulation of Hypothalamic Gene Expression and the Oxytocinergic System by Soybean Oil Diets in Male Mice

Poonamjot Deol, Elena Kozlova, Matthew Valdez, Catherine Ho, Ei-Wen Yang, Holly Richardson, Gwendolyn Gonzalez, Edward Truong, Jack Reid, Joseph Valdez, Jonathan R. Deans, Jose Martinez-Lomeli, Jane R. Evans, Tao Jiang, Frances M. Sladek, Margarita C. Curras-Collazo

ENDOCRINOLOGY (2020)

Article Biochemical Research Methods

OMGS: Optical Map-Based Genome Scaffolding

Weihua Pan, Tao Jiang, Stefano Lonardi

JOURNAL OF COMPUTATIONAL BIOLOGY (2020)

暂无数据