☆ 4.7 Article

Finding the combination of multiple biomarkers to diagnose oral squamous cell carcinoma - A data mining approach

COMPUTERS IN BIOLOGY AND MEDICINE (2022)

Journal

COMPUTERS IN BIOLOGY AND MEDICINE

Volume 143, Issue -, Pages -

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.compbiomed.2022.105296

Keywords

Oral squamous cell carcinoma; Data mining; Feature selection; Metabolites; Salivary biomarkers; Machine learning

Funding

Sao Paulo Research Foundation [2016/08633-0]
National Council of Technological and Scientific Development (CNPq)
Coordination for the Improvement of Higher Education Personnel (CAPES)

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Data mining and machine learning algorithms have been successfully applied to analyze and discover useful knowledge about diseases, including oral squamous cell carcinoma. This study introduces a new approach that utilizes advanced data mining techniques to diagnose oral cancer and identify potential salivary biomarkers.

Data mining has proven to be a reliable method to analyze and discover useful knowledge about various diseases, including cancer research. In particular, data mining and machine learning algorithms to study oral squamous cell carcinoma (OSCC), the most common form of oral cancer, is a new area of research. This malignant neoplasm can be studied using saliva samples. Saliva is an important biofluid that must be used to verify potential biomarkers associated with oral cancer. In this study, first, we provide an overview of OSSC diagnoses based on machine learning and salivary metabolites. To our knowledge, this is the first study to apply advanced data mining techniques to diagnose OSCC. Then, we give new results of classification and feature selection algorithms used to identify potential salivary biomarkers of OSCC. To accomplish this task, we used the filter feature selection random forest importance algorithm and a wrapper methodology to evaluate the importance of metabolites obtained from gas chromatography mass-spectrometry (GC-MS) in the context of differentiation of OSCC and the control group. Salivary samples (n = 68) were collected for the control group, and the OSCC group were from patients matched for gender, age, and smoking habit. The classification process occurred based on Random Forest (RF) classification algorithm along with 10-cross validation. The results showed that glucuronic acid, maleic acid, and batyl alcohol can classify the samples with an area under the curve (AUC) of 0.91 versus an AUC of 0.76 using all 51 metabolites analyzed. The methodology used in this study can assist healthcare professionals and be adopted to discover diagnostic biomarkers for other diseases.

Finding the combination of multiple biomarkers to diagnose oral squamous cell carcinoma - A data mining approach

Journal

COMPUTERS IN BIOLOGY AND MEDICINE

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Finding the combination of multiple biomarkers to diagnose oral squamous cell carcinoma - A data mining approach

Journal

COMPUTERS IN BIOLOGY AND MEDICINE

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper