4.7 Article

Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique

期刊

BIOINFORMATICS
卷 35, 期 12, 页码 2075-2083

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bty943

关键词

-

资金

  1. National Nature Scientific Foundation of China [61772119, 31771471]
  2. Natural Science Foundation for Distinguished Young Scholar of Hebei Province [C2017209244]
  3. Fundamental Research Funds for the Central Universities of China [ZYGX2015Z006, ZYGX2016J118, ZYGX2016J125, ZYGX2016J223]
  4. Science Strength Promotion Program of UESTC

向作者/读者索取更多资源

Motivation DNA replication is a key step to maintain the continuity of genetic information between parental generation and offspring. The initiation site of DNA replication, also called origin of replication (ORI), plays an extremely important role in the basic biochemical process. Thus, rapidly and effectively identifying the location of ORI in genome will provide key clues for genome analysis. Although biochemical experiments could provide detailed information for ORI, it requires high experimental cost and long experimental period. As good complements to experimental techniques, computational methods could overcome these disadvantages. Results Thus, in this study, we developed a predictor called iORI-PseKNC2.0 to identify ORIs in the Saccharomyces cerevisiae genome based on sequence information. The PseKNC including 90 physicochemical properties was proposed to formulate ORI and non-ORI samples. In order to improve the accuracy, a two-step feature selection was proposed to exclude redundant and noise information. As a result, the overall success rate of 88.53% was achieved in the 5-fold cross-validation test by using support vector machine. Availability and implementation Based on the proposed model, a user-friendly webserver was established and can be freely accessed at http://lin-group.cn/server/iORI-PseKNC2.0. The webserver will provide more convenience to most of wet-experimental scholars.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Genetics & Heredity

Computational Analysis Illustrates the Mechanism of Qingfei Paidu Decoction in Blocking the Transition of COVID-19 Patients from Mild to Severe Stage

Xianhai Li, Liu Xiang, Yue Lin, Qiang Tang, Fanbo Meng, Wei Chen

Summary: This study aims to explore the mechanism of Qingfei Paidu Decoction (QFPDD) in blocking the transition of COVID-19 patients from mild to severe stage. Through screening key ingredients and performing KEGG enrichment analysis, it was discovered that QFPDD can prevent the deterioration of COVID-19 by inhibiting relevant genes and multiple signaling pathways.

CURRENT GENE THERAPY (2022)

Article Biochemical Research Methods

A deep learning model to identify gene expression level using cobinding transcription factor signals

Lirong Zhang, Yanchao Yang, Lu Chai, Qianzhong Li, Junjie Liu, Hao Lin, Li Liu

Summary: This study quantitatively analyzed the correlation between transcription factors (TFs) and gene expression and identified TF modules associated with gene expression. A convolutional neural network model, TFCNN, was constructed to predict gene expression levels based on the enrichment characteristics of TFs. Results showed that TFCNN achieved high prediction performance and outperformed other models by better extracting combinatorial interactions among TFs.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

iRice-MS: An integrated XGBoost model for detecting multitype post-translational modification sites in rice

Hao Lv, Yang Zhang, Jia-Shu Wang, Shi-Shi Yuan, Zi-Jie Sun, Fu-Ying Dao, Zheng-Xing Guan, Hao Lin, Ke-Jun Deng

Summary: In this study, a comprehensive method called iRice-MS based on eXtreme Gradient Boosting (XGBoost) was developed to identify multiple post-translational modifications (PTMs) in rice. The method displayed excellent performance in cross-validation and independent dataset test, and showed superiority to existing tools in terms of AUC value.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

Deep-4mCGP: A Deep Learning Approach to Predict 4mC Sites in Geobacter pickeringii by Using Correlation-Based Feature Selection Technique

Hasan Zulfiqar, Qin-Lai Huang, Hao Lv, Zi-Jie Sun, Fu-Ying Dao, Hao Lin

Summary: The study aimed to establish a robust deep learning model to recognize 4mC sites in Geobacter pickeringii. By using different feature descriptors and optimization algorithms, the accuracy of the model was improved.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2022)

Article Biochemical Research Methods

The evolution of N6-methyladenosine regulators in plants

Meng Wu, Fulei Nie, Haibin Liu, Tianyang Zhang, Miaomiao Li, Xiaoming Song, Wei Chen

Summary: This study investigated 1592 m(6)A modification regulators from 65 representative plant species and analyzed their phylogenetic relationships, sequence structure, selection pressure, and codon usage. The study found that regulators from different species or subfamilies were distinguishable based on phylogenetic trees. The gene structure of regulators was conserved, but unique exon/intron structures and motif organizations were observed among different families. The analysis also demonstrated that regulators experienced purifying selection, and the selection pressure was more relaxed in higher plants, suggesting they might have acquired new functions during evolution. Additionally, different codon usage preferences were observed for different kinds of regulators.

METHODS (2022)

Article Biochemistry & Molecular Biology

RNAInter v4.0: RNA interactome repository with redefined confidence scoring system and improved accessibility

Juanjuan Kang, Qiang Tang, Jun He, Le Li, Nianling Yang, Shuiyan Yu, Mengyao Wang, Yuchen Zhang, Jiahao Lin, Tianyu Cui, Yongfei Hu, Puwen Tan, Jun Cheng, Hailong Zheng, Dong Wang, Xi Su, Wei Chen, Yan Huang

Summary: The study updated the RNAInter database to version 4.0, which includes an enlarged data set and an updated confidence scoring system, providing a faster and more user-friendly interface, with over 47 million total entries. This will offer a more comprehensive and readily accessible RNA interactome platform for investigating the regulatory landscape of cellular RNAs.

NUCLEIC ACIDS RESEARCH (2022)

Review Biochemical Research Methods

A comprehensive review of bioinformatics tools for chromatin loop calling

Li Liu, Kaiyuan Han, Huimin Sun, Lu Han, Dong Gao, Qilemuge Xi, Lirong Zhang, Hao Lin

Summary: This review provides an overview of loop-calling tools for various 3C-based techniques. It categorizes and summarizes these tools, discusses background biases and denoising algorithms, and helps researchers select the most appropriate method for loop calling and downstream analysis. It is also useful for bioinformatics scientists aiming to develop new loop-calling algorithms.

BRIEFINGS IN BIOINFORMATICS (2023)

Article Biochemical Research Methods

Single-cell RNA-seq data analysis based on directed graph neural network

Xiang Feng, Hongqi Zhang, Hao Lin, Haixia Long

Summary: In this study, a directed graph neural network called scDGAE was developed for scRNA-seq analysis, using graph autoencoders and graph attention network. The experiment results showed that the scDGAE model achieved promising performance in gene imputation and cell clustering prediction, and it can be applied to general scRNA-Seq analyses.

METHODS (2023)

Review Medicine, Research & Experimental

Artificial intelligence for drug discovery: Resources, methods, and applications

Wei Chen, Xuesong Liu, Sanyin Zhang, Shilin Chen

Summary: Traditional wet laboratory testing and validations are costly and time-consuming for drug discovery. However, advancements in artificial intelligence (AI) techniques are changing the landscape of drug discovery. Combined with accessible data resources, AI techniques have accelerated the drug discovery process and have been widely applied in pharmaceutical analysis and drug design. Nonetheless, there are challenges in applying AI to drug discovery.

MOLECULAR THERAPY-NUCLEIC ACIDS (2023)

Article Chemistry, Medicinal

iPADD: A Computational Tool for Predicting Potential Antidiabetic Drugs Using Machine Learning Algorithms

Xiao-Wei Liu, Tian-Yu Shi, Dong Gao, Cai-Yi Ma, Hao Lin, Dan Yan, Ke-Jun Deng

Summary: Diabetes mellitus is a chronic metabolic disease that disrupts blood glucose homeostasis and leads to severe complications. The development of artificial intelligence has provided a powerful tool, iPADD, for accelerating the discovery of potential antidiabetic drugs. iPADD achieved high accuracy in drug prediction by using molecular fingerprints and machine learning algorithms.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Medicine, General & Internal

A First Computational Frame for Recognizing Heparin-Binding Protein

Wen Zhu, Shi-Shi Yuan, Jian Li, Cheng-Bing Huang, Hao Lin, Bo Liao

Summary: This study provides the first recognition framework for accurately identifying HBP based on machine learning. By using four sequence descriptors, HBP and non-HBP samples were represented by discrete numbers and input into SVM and RF algorithms for comparison. The SVM-based classifier was found to have the greatest potential for identifying HBP.

DIAGNOSTICS (2023)

Review Biochemistry & Molecular Biology

Empirical comparison and recent advances of computational prediction of hormone binding proteins using machine learning methods

Hasan Zulfiqar, Zhiling Guo, Bakanina Kissanga Grace-Mercure, Zhao-Yue Zhang, Hui Gao, Hao Lin, Yun Wu

Summary: Hormone binding proteins (HBPs) belong to soluble carrier proteins that interact selectively and non-covalently with hormones, promoting growth hormone signaling in humans and other animals. The identification of HBPs is crucial for understanding these proteins and their applications in medical and commercial fields. Computational prediction methods, using sequence information and machine learning algorithms, have played a significant role in recognizing HBPs, offering a time-saving and cost-effective alternative to experimental methods.

COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL (2023)

Article Medicine, Research & Experimental

Unveiling the mechanisms of nephrotoxicity caused by nephrotoxic compounds using toxicological network analysis

Kexing Xi, Mengqing Zhang, Mingrui Li, Qiang Tang, Qi Zhao, Wei Chen

Summary: In this study, a network-based methodology was used to explore the mechanisms of nephrotoxicity induced by specific compounds. The results showed that the advanced glycosylation end products-receptor for advanced glycosylation end products signaling pathway, human cytomegalovirus infection, lipid and atheroapoptosis, and the phosphatidylinositol 3-kinase-Akt pathways play important roles in nephrotoxicity.

MOLECULAR THERAPY NUCLEIC ACIDS (2023)

Article Biodiversity Conservation

Urbanisation drives inter- and intraspecific variation in flight-related morphological traits of aquatic insects at different landscape scales

Wenfei Liao, Hao Lin

Summary: Urbanisation has complex effects on the morphological traits of aquatic insect species, with different species exhibiting different strategies and abilities to cope with movement barriers caused by urbanisation.

INSECT CONSERVATION AND DIVERSITY (2023)

Article Biology

Identification of Key DNA methylation sites related to differentially expressed genes in Lung squamous cell carcinoma

Jie Gao, Yongxian Feng, Yan Yang, Yuetong Shi, Junjie Liu, Hao Lin, Lirong Zhang

Summary: This study systematically identified and analyzed key CpG sites closely related to differential expression of genes in LUSC through a two-step correlation analysis method, and found that these sites and genes can serve as effective biomarkers for LUSC.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

暂无数据