4.5 Article

Null space based feature selection method for gene expression data

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s13042-011-0061-9

关键词

Feature selection; Null space; DNA microarray gene expression data; Classification accuracy; Biological significance

资金

  1. Grants-in-Aid for Scientific Research [10F00364] Funding Source: KAKEN

向作者/读者索取更多资源

Feature selection is quite an important process in gene expression data analysis. Feature selection methods discard unimportant genes from several thousands of genes for finding important genes or pathways for the target biological phenomenon like cancer. The obtained gene subset is used for statistical analysis for prediction such as survival as well as functional analysis for understanding biological characteristics. In this paper we propose a null space based feature selection method for gene expression data in terms of supervised classification. The proposed method discards the redundant genes by applying the information of null space of scatter matrices. We derive the method theoretically and demonstrate its effectiveness on several DNA gene expression datasets. The method is easy to implement and computationally efficient.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemical Research Methods

Single-stranded and double-stranded DNA-binding protein prediction using HMM profiles

Ronesh Sharma, Shiu Kumar, Tatsuhiko Tsunoda, Thirumananseri Kumarevel, Alok Sharma

Summary: DNA-binding proteins play essential roles in cellular processes, with single-stranded and double-stranded proteins classified based on their interactions with DNA. Computational prediction of these proteins aids in understanding their functions and binding domains. A proposed method using hidden Markov model profiles achieved improved performance compared to benchmark methods, with approximately 3% overall improvement.

ANALYTICAL BIOCHEMISTRY (2021)

Article Computer Science, Artificial Intelligence

OPTICAL+: a frequency-based deep learning scheme for recognizing brain wave signals

Shiu Kumar, Ronesh Sharma, Alok Sharma

Summary: This study introduces a frequency-based approach using LSTM network for recognizing different brain wave signals, incorporating adaptive filtering with genetic algorithm to achieve improved performance compared to existing methods.

PEERJ COMPUTER SCIENCE (2021)

Article Biochemical Research Methods

Critical assessment of protein intrinsic disorder prediction

Marco Necci, Damiano Piovesan, Silvio C. E. Tosatto

Summary: Intrinsically disordered proteins present a challenge to traditional protein structure-function analysis, with computational methods, particularly deep learning techniques, showing superior performance in predicting disorder. However, predicting disordered binding regions remains difficult, and there is a significant variation in computational times among methods.

NATURE METHODS (2021)

Article Biochemical Research Methods

Forecasting the spread of COVID-19 using LSTM network

Shiu Kumar, Ronesh Sharma, Tatsuhiko Tsunoda, Thirumananseri Kumarevel, Alok Sharma

Summary: The rapid spread of the COVID-19 pandemic globally has had a profound impact, making it crucial to predict when a country may be able to contain the virus. Researchers successfully forecasted the date when New Zealand contained the virus using a long short-term memory network model and have applied it to other countries as well.

BMC BIOINFORMATICS (2021)

Article Biochemical Research Methods

SPECTRA: a tool for enhanced brain wave signal recognition

Shiu Kumar, Tatsuhiko Tsunoda, Alok Sharma

Summary: The proposed SPECTRA predictor achieved the lowest average error rates and highest average kappa coefficient values compared to other methods, demonstrating its effectiveness in improving brain wave signal recognition for the development of computationally efficient real-time BCI systems.

BMC BIOINFORMATICS (2021)

Article Biochemical Research Methods

DeepFeature: feature selection in nonimage data using convolutional neural network

Alok Sharma, Artem Lysenko, Keith A. Boroevich, Edwin Vans, Tatsuhiko Tsunoda

Summary: Artificial intelligence methods, particularly deep neural networks such as convolutional neural networks, offer capabilities for discovering complex biological mechanisms from raw data. However, interpreting the results of these methods in a biomedical context remains a challenge. This study introduces an approach using CNN for nonimage data feature selection, showing promising results for predicting cancer types and identifying key pathways.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Chemistry, Medicinal

RNA Backbone Torsion and Pseudotorsion Angle Prediction Using Dilated Convolutional Neural Networks

Jaswinder Singh, Kuldip Paliwal, Jaspreet Singh, Yaoqi Zhou

Summary: The dilated convolutional neural network method SPOT-RNA-1D predicts RNA backbone torsion and pseudotorsion angles with smaller mean absolute errors compared to random and helix prediction methods. It accurately recovers overall patterns of angle distributions but faces difficulty in predicting angles further away from bases involved in tertiary interactions. SPOT-RNA-1D yields more accurate dihedral angles than the best models in RNA-puzzles experiments, showing potential as model quality indicators and restraints for RNA structure prediction.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2021)

Article Biochemical Research Methods

CluSem: Accurate clustering-based ensemble method to predict motor imagery tasks from multi-channel EEG data

Md Ochiuddin Miah, Rafsanjani Muhammod, Khondaker Abdullah Al Mamun, Dewan Md Farid, Shiu Kumar, Alok Sharma, Abdollah Dehzangi

Summary: This paper introduces a novel clustering-based ensemble technique called CluSem to enhance the classification performance of real-time BCI applications. A new brain game named CluGame is developed using this method to evaluate the classification performance of real-time motor imagery movements. Results show that CluSem improves classification accuracy by 5% to 15% compared to existing methods on collected and publicly available EEG datasets.

JOURNAL OF NEUROSCIENCE METHODS (2021)

Article Multidisciplinary Sciences

ACP-MHCNN: an accurate multi-headed deep-convolutional neural network to predict anticancer peptides

Sajid Ahmed, Rafsanjani Muhammod, Zahid Hossain Khan, Sheikh Adilina, Alok Sharma, Swakkhar Shatabda, Abdollah Dehzangi

Summary: In this study, a new multi-headed deep convolutional neural network model called ACP-MHCNN is proposed for extracting and combining discriminative features from different information sources in an interactive way to identify anticancer peptides. The model outperforms other existing models for anticancer peptide identification by a substantial margin, demonstrating higher accuracy, sensitivity, specificity, precision, and MCC.

SCIENTIFIC REPORTS (2021)

Article Multidisciplinary Sciences

A convolutional neural network based tool for predicting protein AMPylation sites from binary profile representation

Sayed Mehedi Azim, Alok Sharma, Iman Noshadi, Swakkhar Shatabda, Iman Dehzangi

Summary: AMPylation is an emerging post-translational modification that plays a role in neurodevelopment and neurodegeneration. However, there is a lack of computational approaches for predicting AMPylation due to a lack of peptide sequence datasets. In this study, a new dataset and machine learning tool called DeepAmp were introduced, achieving promising results in predicting AMPylation sites in proteins.

SCIENTIFIC REPORTS (2022)

Article Genetics & Heredity

CNN-Pred: Prediction of single-stranded and double-stranded DNA-binding protein using convolutional neural networks

Farnoush Manavi, Alok Sharma, Ronesh Sharma, Tatsuhiko Tsunoda, Swakkhar Shatabda, Iman Dehzangi

Summary: DNA-binding proteins play a vital role in biological activity including replication, packing, and reparation of DNA. They can be classified into single-stranded DNA-binding proteins (SSBs) or double-stranded DNA-binding proteins (DSBs) which help determine their function. Despite previous efforts, the prediction accuracy of DSB and SSB remains limited. In this study, a new method called CNN-Pred is proposed, which accurately predicts DSB and SSB using evolutionary-based features extracted from position specific scoring matrix (PSSM) with a 1D-convolutional neural network (CNN) as the classifier. The results show that CNN-Pred improves DSB and SSB prediction accuracies by more than 4% compared to previous studies. CNN-Pred is available as a standalone tool with its source codes on GitHub: https://github.com/MLBC-lab/CNN-Pred.
Article Biochemistry & Molecular Biology

CAID prediction portal: a comprehensive service for predicting intrinsic disorder and binding regions in proteins

Alessio Del Conte, Adel Bouhraoua, Mahta Mehdiabadi, Damiano Clementel, Alexander Miguel Monzon, Damiano CAID Predictors, Silvio C. E. Tosatto, Damiano Piovesan

Summary: Intrinsic disorder (ID) in proteins is a well-established phenomenon in structural biology, but measuring its behavior on a large scale is challenging. To address this issue, CAID benchmarks ID predictors and creates a web server, the CAID Prediction Portal, which executes all CAID methods on user-defined sequences. The server generates standardized output, facilitates comparison between methods, and provides a valuable resource for researchers studying ID in proteins.

NUCLEIC ACIDS RESEARCH (2023)

Article Multidisciplinary Sciences

DeepInsight-3D architecture for anti-cancer drug response prediction with deep-learning on multi-omics

Alok Sharma, Artem Lysenko, Keith A. Boroevich, Tatsuhiko Tsunoda

Summary: Modern oncology offers a wide range of treatments, and selecting the best option for each patient is crucial for optimal outcomes. Multi-omics profiling combined with AI-based predictive models show promise in streamlining treatment decisions, but are hindered by high dimensionality of datasets and limited annotated samples. Here, we propose a novel deep learning-based method, DeepInsight-3D, to predict patient-specific anticancer drug response using multi-omics data. This approach converts structured data into images and leverages convolutional neural networks to handle high dimensionality while modeling complex relationships between variables. DeepInsight-3D outperforms other state-of-the-art methods and has the potential to aid in the development of personalized treatment strategies for various cancers.

SCIENTIFIC REPORTS (2023)

Article Biochemical Research Methods

scDeepInsight: a supervised cell-type identification method for scRNA-seq data with deep learning

Shangru Jia, Artem Lysenko, Keith A. Boroevich, Alok Sharma, Tatsuhiko Tsunoda

Summary: Cell-type annotation is a critical step in analyzing scRNA-seq data. Most current methods use unsupervised clustering algorithms, resulting in rough classification. To address this issue, we propose scDeepInsight, a supervised annotation method that performs manifold assignments, data integration, supervised training, outlier detection, and cell-type annotation. It can also identify active genes related to cell-types.

BRIEFINGS IN BIOINFORMATICS (2023)

Article Computer Science, Information Systems

DeepLPC-MHANet: Multi-Head Self-Attention for Augmented Kalman Filter-Based Speech Enhancement

Sujan Kumar Roy, Aaron Nicolson, Kuldip K. Paliwal

Summary: The study investigates the use of MHANet for LPC estimation to reduce bias and improve speech enhancement quality, validated through subjective AB listening tests and seven objective measures.

IEEE ACCESS (2021)

暂无数据