4.6 Review

A Review on Missing Value Imputation Algorithms for Microarray Gene Expression Data

Journal

CURRENT BIOINFORMATICS
Volume 9, Issue 1, Pages 18-22

Publisher

BENTHAM SCIENCE PUBL LTD
DOI: 10.2174/1574893608999140109120957

Keywords

Gene expression analysis; gene expression data; information recovery; microarray data; missing value estimation; missing value imputation

Funding

  1. Malaysian Ministry of Higher Education [R.J130000.7807.4L096]
  2. Fundamental Research Grant Scheme [R.J130000.7807.4F190]
  3. Malaysian Ministry of Science, Technology and Innovation [06-0106-SF1029]

Ask authors/readers for more resources

Many bioinformatics analytical tools, especially for cancer classification and prediction, require complete sets of data matrix. Having missing values in gene expression studies significantly influences the interpretation of final data. However, to most analysts' dismay, this has become a common problem and thus, relevant missing value imputation algorithms have to be developed and/or refined to address this matter. This paper intends to present a review of preferred and available missing value imputation methods for the analysis and imputation of missing values in gene expression data. Focus is placed on the abilities of algorithms in performing local or global data correlation to estimate the missing values. Approaches of the algorithms mentioned have been categorized into global approach, local approach, hybrid approach, and knowledge assisted approach. The methods presented are accompanied with suitable performance evaluation. The aim of this review is to highlight possible improvements on existing research techniques, rather than recommending new algorithms with the same functional aim.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Chemistry, Analytical

Deepint.net: A Rapid Deployment Platform for Smart Territories

Juan M. Corchado, Pablo Chamoso, Guillermo Hernandez, Agustin San Roman Gutierrez, Alberto Rivas Camacho, Alfonso Gonzalez-Briones, Francisco Pinto-Santos, Enrique Goyenechea, David Garcia-Retuerta, Maria Alonso-Miguel, Beatriz Bellido Hernandez, Diego Valdeolmillos Villaverde, Manuel Sanchez-Verdejo, Pablo Plaza-Martinez, Manuel Lopez-Perez, Sergio Manzano-Garcia, Ricardo S. Alonso, Roberto Casado-Vara, Javier Prieto Tejedor, Fernando de la Prieta, Sara Rodriguez-Gonzalez, Javier Parra-Dominguez, Mohd Saberi Mohamad, Saber Trabelsi, Enrique Diaz-Plaza, Jose Alberto Garcia-Coria, Tan Yigitcanlar, Paulo Novais, Sigeru Omatu

Summary: This paper presents an efficient cyberphysical platform for smart city management, which can utilize various data sources and includes a complete artificial intelligence suite, enabling adaptation to new requirements and running effective computational and artificial intelligence algorithms.

SENSORS (2021)

Article Neurosciences

SLC17A3 rs9379800 and Ischemic Stroke Susceptibility at the Northern Region of Malaysia

Shu Chai Ching, Lim Jing Wen, Nor Ismaliza Mohd Ismail, Irene Looi, Cheah Wee Kooi, Long Soo Peng, Lee Soon Mui, Jayashamani Tamibmaniam, Prema Muninathan, Ong Beng Hooi, Siti Maisarah Md Ali, Muhammad Radzi Abu Hassan, Mohd Saberi Mohamad, Lyn R. Griffiths, Loo Keat Wei

Summary: This study found that the SLC17A3 rs9379800 polymorphism and its gene expression are significantly associated with ischemic stroke risk among the Malay population in the Northern region of Malaysia. However, no significant associations were observed for PITX2, NINJ2, TWIST1, Rasip1, and MUT polymorphisms with ischemic stroke risk. Lower mRNA expression levels of Rasip1, SLC17A3, MUT and FERD3L were observed among cases.

JOURNAL OF STROKE & CEREBROVASCULAR DISEASES (2021)

Review Engineering, Chemical

A Review on Recent Progress in Machine Learning and Deep Learning Methods for Cancer Classification on Gene Expression Data

Aina Umairah Mazlan, Noor Azida Sahabudin, Muhammad Akmal Remli, Nor Syahidatul Nadiah Ismail, Mohd Saberi Mohamad, Hui Wen Nies, Nor Bakiah Abd Warif

Summary: This paper discusses the importance of using data-driven models with predictive ability in medical and healthcare, particularly focusing on the application of machine learning (ML) and deep learning (DL) in cancer classification. While various methods have been applied to cancer classification, successful techniques mainly revolve around supervised and deep learning methods.

PROCESSES (2021)

Article Mathematical & Computational Biology

In silico gene knockout prediction using a hybrid of Bat algorithm and minimization of metabolic adjustment

Mei Yen Man, Mohd Saberi Mohamad, Yee Wen Choon, Mohd Arfian Ismail

Summary: Microorganisms commonly produce high-demand industrial products like fuels and vitamins, and microbial strains can be optimized through metabolic engineering, which includes gene knockout to maximize production rates.

JOURNAL OF INTEGRATIVE BIOINFORMATICS (2021)

Article Mathematical & Computational Biology

A hybrid of Bees algorithm and regulatory on/off minimization for optimizing lactate and succinate production

Mohd Izzat Yong, Mohd Saberi Mohamad, Yee Wen Choon, Weng Howe Chan, Hasyiya Karimah Adli, Khairul Nizar Syazwan Wsw, Nooraini Yusoff, Muhammad Akmal Remli

Summary: Metabolic engineering plays an important role in biomass production, particularly in microbial biomass production. In order to address the unrealistic flux distribution issue in prior work, a hybrid method of Bees Algorithm and Regulatory On/Off Minimization (BAROOM) was used.

JOURNAL OF INTEGRATIVE BIOINFORMATICS (2022)

Article Computer Science, Theory & Methods

A hybrid of ant colony optimization, genetic algorithm and flux balance analysis for optimization of succinic acid production in Escherichia coli

Jun Bin Tan, Yee Wen Choon, Kohbalan Moorthy, Hasyiya Karimah Adli, Muhammad Akmal Remli, Mohd Arfian Ismail, Zuwairie Ibrahim, Mohd Saberi Mohamad

Summary: This paper proposes a hybrid approach of ant colony optimization-genetic algorithm-flux balance analysis (ACOGAFBA) to enhance the succinic acid production of E. coli by identifying genes to be knocked out. The results show that ACOGAFBA can identify the set of knockout genes to improve succinic acid production in E. coli.

INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING (2023)

Review Chemistry, Analytical

Recent Advancements and Challenges of AIoT Application in Smart Agriculture: A Review

Hasyiya Karimah Adli, Muhammad Akmal Remli, Khairul Nizar Syazwan Wan Salihin Wong, Nor Alina Ismail, Alfonso Gonzalez-Briones, Juan Manuel Corchado, Mohd Saberi Mohamad

Summary: As the most popular technologies of the 21st century, AI and IoT have played a vital role in transforming the agricultural industry during the pandemic. The convergence of AI and IoT has sparked interest in AIoT, which significantly addresses challenges in agriculture such as pest management and post-harvest management. This paper presents a systematic literature review of AIoT, highlighting its current progress, applications, advantages, and challenges for adoption in modern agriculture.

SENSORS (2023)

Article Mathematical & Computational Biology

Artificial Bee Colony algorithm in estimating kinetic parameters for yeast fermentation pathway

Ahmad Muhaimin Ismail, Muhammad Akmal Remli, Yee Wen Choon, Nurul Athirah Nasarudin, Nor-Syahidatul N. Ismail, Mohd Arfian Ismail, Mohd Saberi Mohamad

Summary: Accurate kinetic parameters are necessary for analyzing metabolic pathways in systems biology. The fermentation pathway in the Saccharomyces cerevisiae kinetic model can be simulated to save time in the optimization process. Parameter estimation is conducted to obtain optimal values for parameters related to the fermentation process, which is essential to avoid erroneous conclusions. The Artificial Bee Colony algorithm (ABC) is proposed to estimate the parameters in the fermentation pathway, providing more accurate values compared to other estimation algorithms.

JOURNAL OF INTEGRATIVE BIOINFORMATICS (2023)

Review Pharmacology & Pharmacy

A review of SARS-CoV-2 drug repurposing: databases and machine learning models

Marim Elkashlan, Rahaf M. Ahmad, Malak Hajar, Fatma Al Jasmi, Juan Manuel Corchado, Nurul Athirah Nasarudin, Mohd Saberi Mohamad

Summary: The emergence of SARS-CoV-2 has posed a serious threat worldwide, calling for efficient solutions. Drug repurposing, particularly through machine learning, offers a promising approach to identify potential inhibitors. Reliable digital databases are important for data extraction in machine learning-based drug repurposing.

FRONTIERS IN PHARMACOLOGY (2023)

Review Computer Science, Artificial Intelligence

Offline Handwritten Chinese Character Using Convolutional Neural Network: State-of-the-Art Methods

Yingna Zhong, Kauthar Mohd Daud, Ain Najiha Binti Mohamad Nor, Richard Adeyemi Ikuesan, Kohbalan Moorthy

Summary: Handwritten character recognition is invaluable in society, especially for Chinese characters. Convolutional neural networks have achieved outstanding results in offline handwritten character recognition.

JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS (2023)

Review Computer Science, Information Systems

Review and Analysis of Patients' Body Language From an Artificial Intelligence Perspective

Sherzod Turaev, Saja Al-Dabet, Aiswarya Babu, Zahiriddin Rustamov, Jaloliddin Rustamov, Nazar Zaki, Mohd Saberi Mohamad, Chu Kiong Loo

Summary: Body language is a nonverbal form of communication that includes movements, postures, gestures, and expressions of the body. It expresses human feelings, thoughts, and intentions, and also reveals physical and psychological health conditions. The importance of studying the body language of people with health conditions can be seen through various reports in literature.

IEEE ACCESS (2023)

Proceedings Paper Computer Science, Cybernetics

A Hybrid of Bees Algorithm and Regulatory On/Off Minimization for Optimizing Lactate Production

Mohd Izzat Yong, Mohd Saberi Mohamad, Yee Wen Choon, Weng Howe Chan, Hasyiya Karimah Adli, Khairul Nizar W. S. W. Syazwan, Nooraini Yusoff, Muhammad Akmal Remli

Summary: Metabolic engineering is widely used for biomass production using microorganisms, and metabolic network models are employed for optimizing production and suggesting modifications. The BAROOM method, a hybrid of Bees Algorithm and Regulatory On/Off Minimization, improves lactate production in a model organism by identifying optimal genes to be knocked out, outperforming other methods.

PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, PACBB 2021 (2022)

Article Computer Science, Artificial Intelligence

Enhancement of Ethanol Production Using a Hybrid of Firefly Algorithm and Dynamic Flux Balance Analysis

Wan Ting Leong, Mohd Saberi Mohamad, Kohbalan Moorthy, Yee Wen Choon, Hasyiya Karimah Adli, Khairul Nizar W. S. W. Syazwan, Loo Keat Wei, Nazar Zaki

Summary: Metabolic engineering using microorganisms is a method to produce high-demand industrial products. This paper proposes a hybrid method of firefly algorithm and dynamic flux balance analysis to predict the gene knockout list for ethanol production.

INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH (2022)

No Data Available