4.2 Article

Ensemble Learning for Multidimensional Poverty Classification

期刊

SAINS MALAYSIANA
卷 49, 期 2, 页码 447-459

出版社

UNIV KEBANGSAAN MALAYSIA
DOI: 10.17576/jsm-2020-4902-24

关键词

Machine learning; multidimensional poverty; random forest

资金

  1. UKM under the grand challenge LAB40 research grant [DCP-2017-015/1]

向作者/读者索取更多资源

The poverty rate in Malaysia is determined through financial or income indices and measurements. As such, periodic measurements are conducted through Household Expenditure and Income Survey (HEIS) twice every five years, and subsequently used to generate a Poverty Line Income (PLI) to determine poverty levels through statistical methods. Such uni-dimensional measurement however is unable to portray the overall deprivation conditions, especially based on the experience of the urban population. In addition, the United Nation Development Programme (UNDP) has introduced a set of multi-dimensional poverty measurements but is yet to be applied in the case of Malaysia. In view of this, a potential use of Machine Learning (ML) approaches that can produce new poverty measurement methods is therefore of interest, which must be triggered by the existence of a rich database collection on poverty, such as the eKasih database maintained by the Malaysian Government. The goal of this study was to determine whether ensemble learning method (random forest) can classify poverty and hence produce multidimensional poverty indicator compared to based learner method using eKasih dataset. CRoss Industry Standard Process for Data Mining (CRISP-DM) methods was used to ensure data mining and ML processes were conducted properly. Beside Random Forest, we also examined decision tree and general linear methods to benchmark their performance and determine the method with the highest accuracy. Fifteen variables were then rank using varImp method to search for important variables. Analysis of this study showed that Per Capita Income, State, Ethnic, Strata, Religion, Occupation and Education were found to be the most important variables in the classification of poverty at a rate of 99% accuracy confidence using Random Forest algorithm.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Health Care Sciences & Services

Malay Version of the mHealth App Usability Questionnaire (M-MAUQ): Translation, Adaptation, and Validation Study

Norashikin Mustafa, Nik Shanita Safii, Aida Jaffar, Nor Samsiah Sani, Mohd Izham Mohamad, Abdul Hadi Abd Rahman, Sherina Mohd Sidik

Summary: The study translated and validated a Malay version of the mHealth App Usability Questionnaire (MAUQ) for future research and usage in Malaysia. The M-MAUQ demonstrated good reliability and validity, making it suitable for assessing the usability of mHealth apps in Malay.

JMIR MHEALTH AND UHEALTH (2021)

Article Environmental Sciences

Utilization of process network synthesis and machine learning as decision-making tools for municipal solid waste management

R. A. Ali, N. N. L. Nik Ibrahim, W. A. Wan Ab Karim Ghani, H. L. Lam, N. S. Sani

Summary: This study utilizes process network synthesis and data mining techniques as optimization models to evaluate the potential of decision-making in municipal solid waste management, with findings showing that the multilayer perceptron model performed well and can serve as a basis for decision-making in waste management. Integrating optimization models can provide an efficient tool for waste management decision-making.

INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY (2022)

Article Mathematics

Hybrid Symmetrical Uncertainty and Reference Set Harmony Search Algorithm for Gene Selection Problem

Salam Salameh Shreem, Mohd Zakree Ahmad Nazri, Salwani Abdullah, Nor Samsiah Sani

Summary: Selecting the most minimal set of genes from microarray datasets for clinical diagnosis and prediction is a challenging task in machine learning. This study proposes a gene selection method called SU-RSHSA that combines the advantages of the Symmetrical Uncertainty (SU) filter and Reference Set Harmony Search Algorithm (RSHSA) wrapper to generate a small subset of genes with high classification accuracy.

MATHEMATICS (2022)

Article Computer Science, Information Systems

Enhanced clustering models with wiki-based k-nearest neighbors-based representation for web search result clustering

Ali Sabah Abdulameer, Sabrina Tiun, Nor Samsiah Sani, Masri Ayob, Adil Yaseen Taha

Summary: Due to the overabundance of information on the web, existing clustering methods have limitations in clustering short texts. This study proposes an enhanced framework by expanding document terms to improve the clustering performance of web search results.

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES (2022)

Article Energy & Fuels

Energy-Aware Bag-of-Tasks Scheduling in the Cloud Computing System Using Hybrid Oppositional Differential Evolution-Enabled Whale Optimization Algorithm

Amit Chhabra, Sudip Kumar Sahana, Nor Samsiah Sani, Ali Mohammadzadeh, Hasmila Amirah Omar

Summary: A new optimization algorithm h-DEWOA was introduced to address the Cloud Bag-of-Tasks Scheduling (CBS) problem by enhancing the exploration ability and solution diversity of the Whale Optimization Algorithm (WOA), achieving superior scheduling solutions and demonstrating excellent performance in experiments.

ENERGIES (2022)

Article Computer Science, Information Systems

An Optimized Approach for Predicting Water Quality Features Based on Machine Learning

Nur Afyfah Suwadi, Morched Derbali, Nor Samsiah Sani, Meng Chun Lam, Haslina Arshad, Imran Khan, Ki-Il Kim

Summary: This study utilizes machine learning classification methods to predict water quality index (WQI) and identifies important features for prediction. The optimized Random Forest classifier with the WQI parameter selected by information gain achieved the highest performance. The study shows that the parameters oxygen (DO) and biochemical oxygen demand (BOD) are important features for predicting WQI. The proposed model has reasonable accuracy and minimal parameters, making it suitable for real-time water quality detection systems.

WIRELESS COMMUNICATIONS & MOBILE COMPUTING (2022)

Article Environmental Sciences

Water Quality Index Classification Based on Machine Learning: A Case from the Langat River Basin Model

Illa Iza Suhana Shamsuddin, Zalinda Othman, Nor Samsiah Sani

Summary: Traditionally, evaluating water quality has been expensive and ineffective for real-time monitoring. This study utilizes machine learning methods to construct a model capable of predicting water quality and finds that the Support Vector Machines (SVM) model performs the best in predicting river water quality. Additionally, the use of kernel functions, grid search methods, and multiclass classification techniques significantly impacts the effectiveness of the SVM model.
Article Chemistry, Multidisciplinary

Clustering Analysis for Classifying Student Academic Performance in Higher Education

Ahmad Fikri Mohamed Nafuri, Nor Samsiah Sani, Nur Fatin Aqilah Zainudin, Abdul Hadi Abd Rahman, Mohd Aliff

Summary: This study proposes a clustering-based approach to classify B40 students based on their performance in higher education institutions, aiming to assist the government in reducing dropout rates, increasing graduation rates, and boosting students' socioeconomic status.

APPLIED SCIENCES-BASEL (2022)

Article Medicine, General & Internal

Automatic Malignant and Benign Skin Cancer Classification Using a Hybrid Deep Learning Approach

Atheer Bassel, Amjed Basil Abdulkareem, Zaid Abdi Alkareem Alyasseri, Nor Samsiah Sani, Husam Jasim Mohammed

Summary: This article introduces the classification and identification methods for skin cancer, and proposes a classifier stacking method based on three-fold cross-validation. The method trains the system with deep learning and other machine learning methods in three levels on the training set, and achieves high accuracy on the test set.

DIAGNOSTICS (2022)

Article Engineering, Multidisciplinary

A Hybrid P-Graph And WEKA Approach In Decision-Making: Waste Conversion Technologies Selection

Rabiatul Adawiyah Ali, Nik Nor Liyana Nik Ibrahim, Wan Azlina Wan Abdul Karim Ghani, Nor Samsiah Sani, Hon Loong Lam

Summary: This study presents a decision-making integration framework based on hybrid process network synthesis and machine learning for equipment selection in municipal solid waste management. The P-graph is used to generate possible structures, and data from feasible structures are processed and evaluated using WEKA software. The J48 model is found to be the best for equipment selection with an 80:20 train and test learning technique. The framework is represented by a graphical user interface in MATLAB, focusing on the selection of waste conversion technologies.

JOURNAL OF APPLIED SCIENCE AND ENGINEERING (2022)

Article Computer Science, Information Systems

An Optimal Framework for SDN Based on Deep Neural Network

Abdallah Abdallah, Mohamad Khairi Ishak, Nor Samsiah Sani, Imran Khan, Fahad R. Albogamy, Hirofumi Amano, Samih M. Mostafa

Summary: This article proposes a novel DDoS traffic detection method based on information entropy and deep neural network (DNN). By calculating the information entropy value of data packets and using DNN for identification, it can accurately detect DDoS activity efficiently.

CMC-COMPUTERS MATERIALS & CONTINUA (2022)

Article Computer Science, Theory & Methods

Development of Pipe Inspection Robot using Soft Actuators, Microcontroller and LabVIEW

Mohd Aliff, Mohammad Imran, Sairul Izwan, Mohd Ismail, Nor Samsiah, So Shimooka, Tetsuya Akagi, Shujiro Dohta, Weihang Tian, Ahmad Athif

Summary: Pipeline transportation is crucial in today's world, and compact and portable pipe inspection robots with pneumatic actuators are needed. This study focuses on proposing mechanisms such as sliding, holding, and bending units to enable easy and efficient movement of robots in pipelines.

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS (2022)

Article Computer Science, Information Systems

A Sparse Optimization Approach for Beyond 5G mmWave Massive MIMO Networks

Waleed Shahjehan, Abid Ullah, Syed Waqar Shah, Imran Khan, Nor Samsiah Sani, Ki-Il Kim

Summary: This paper proposes an energy-efficient hybrid precoding algorithm based on RF chains selection for mmWave massive MIMO networks to reduce energy consumption and cost, and provide desirable quality-of-service. Simulation results show that the algorithm can effectively improve system performance under different operating conditions.

CMC-COMPUTERS MATERIALS & CONTINUA (2022)

Article Computer Science, Theory & Methods

A Regression Model to Predict Key Performance Indicators in Higher Education Enrollments

Ashraf Abdelhadi, Suhaila Zainudin, Nor Samsiah Sani

Summary: Performance indicators are crucial for organizational success as they measure current performance and track progress towards business objectives. This study utilized regression models to predict accurate KPIs based on student enrollment data, demonstrating that using linear regression with a 40% training and 60% testing split produced the best results.

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS (2022)

Article Computer Science, Theory & Methods

Machine Learning for Predicting Employee Attrition

Norsuhada Mansor, Nor Samsiah Sani, Mohd Aliff

Summary: In this study, the performance of machine learning techniques in predicting employee attrition was compared, with the optimized SVM model demonstrated as the best predictor with an accuracy rate of 88.87%. Various preprocessing steps and optimization techniques were applied to the dataset for analysis.

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS (2021)

暂无数据