4.5 Article

HMATC: Hierarchical multi-label Arabic text classification model using machine learning

期刊

EGYPTIAN INFORMATICS JOURNAL
卷 22, 期 3, 页码 225-237

出版社

CAIRO UNIV, FAC COMPUTERS & INFORMATION
DOI: 10.1016/j.eij.2020.08.004

关键词

Text classification; Multi-label classification; Hierarchical classification; Machine learning; Arabic natural language processing

资金

  1. Deanship of Scientific Research (DSR) , King Abdulaziz University, Jeddah, Saudi Arabia [DG 29-612-1440]

向作者/读者索取更多资源

This study investigates hierarchical multi-label classification in the context of the Arabic language, proposing a hierarchical multi-label Arabic text classification model with a machine learning approach. It examines the impact of feature selection methods and feature set dimensions on classification performance and optimizes the Hierarchy Of Multilabel ClassifiER (HOMER) algorithm. Results show that the proposed model outperforms existing models in terms of computational cost and various evaluation metrics.
Multi-label classification assigns multiple labels to each document concurrently. Many real-world classification problems tend to employ high-dimensional label spaces, which can be naturally structured in a hierarchy. In this type of problem, each instance may belong to multiple labels and labels are organized in a hierarchical structure. It presents a more complex problem than flat classification, given that the classification algorithm has to take into account hierarchical relationships between labels and be able to predict multiple labels for the same instance. Few studies have investigated multi-label text classification for the Arabic language. Most of these studies have focused mainly on flat classification and have neglected the hierarchical structure. Therefore, this paper explores the hierarchical multi-label classification in the context of the Arabic language. It proposes a hierarchical multi-label Arabic text classification (HMATC) model with a machine learning approach. The impact of feature selection methods and feature set dimensions on classification performance are also investigated. In addition, the Hierarchy Of Multilabel ClassifiER (HOMER) algorithm is optimized via examination of different sets of multi-label classifiers, clustering algorithms and different numbers of clusters to improve the hierarchical classification. Moreover, this study contributes to existing research by introducing a hierarchical multi-label Arabic dataset in an appropriate format for hierarchical classification and making it publicly available. The results reveal that the proposed model outperforms all models considered in the experiments in terms of the computational cost, which consumed less cost (2 h) compared with other evaluated models. In addition, it shows a significant improvement compared with the state-of-the-art model (Fatwa model) in terms of Hamming loss (0.004), hierarchical loss (1.723), multi-label accuracy (0.758), subset accuracy (0.292), micro-averaged precision (0.879), micro-averaged recall (0.828), and micro-averaged F-measure (0.853). (C) 2020 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Computers and Artificial Intelligence, Cairo University.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Telecommunications

Machine Learning Approaches for Anomaly Detection in IoT: An Overview and Future Research Directions

Nusaybah Alghanmi, Reem Alotaibi, Seyed M. Buhari

Summary: This paper reviews the relevant literature on anomaly detection techniques using various machine learning approaches in the IoT, analyzes the issues with different anomaly detection datasets, and lists future research directions in this field.

WIRELESS PERSONAL COMMUNICATIONS (2022)

Article Mathematics, Interdisciplinary Applications

APPLICATION OF NONLINEAR DYNAMIC EXPECTATION AND STOCHASTIC DIFFERENTIAL EQUATION IN VALUATION AND FINANCING RISK MEASUREMENT OF TECHNOLOGY-BASED SMALL AND MEDIUM-SIZED ENTERPRISES

Ximei Li, Reem Alotaibi

Summary: The purpose of this paper is to construct different risk measurement models by combining the financing risk of enterprises with the psychological factors of the consumer market, in order to evaluate the pricing and financing risks of technology-based SMEs. The findings show that the nonlinear expectation and stochastic differential equation can reflect changes in enterprise value and the impact of investor psychology on financing effectiveness. By applying the nonparametric estimation method, the accuracy of the model prediction can be improved.

FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY (2022)

Article Computer Science, Artificial Intelligence

Time series predicting of COVID-19 based on deep learning

Madini O. Alassafi, Mutasem Jarrah, Reem Alotaibi

Summary: The study developed a prediction model for the spread of COVID-19 in Malaysia, Morocco, and Saudi Arabia using public datasets from the European Centre for Disease Prevention and Control. Deep learning models were utilized with a focus on LSTM networks. The study also compared the number of cases and deaths in the three countries.

NEUROCOMPUTING (2022)

Article Environmental Sciences

Spatial Analysis of COVID-19 Vaccine Centers Distribution: A Case Study of the City of Jeddah, Saudi Arabia

Kamil Faisal, Sultanah Alshammari, Reem Alotaibi, Areej Alhothali, Omaimah Bamasag, Nusaybah Alghanmi, Manal Bin Yamin

Summary: The spatial distribution of vaccine centers is crucial for effective epidemic responses, and GIS analysis can be used to enhance coverage and efficiency. It is recommended to consider areas with broader coverage when allocating vaccine centers and to increase the number of centers to ensure fairness and equity in vaccine distribution.

INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH (2022)

Review Public, Environmental & Occupational Health

A Survey of Location-Allocation of Points of Dispensing During Public Health Emergencies

Nusaybah Alghanmi, Reem Alotaibi, Sultanah Alshammari, Areej Alhothali, Omaimah Bamasag, Kamil Faisal

Summary: This study presents a survey of the point of dispensing (PODs) location-allocation problem during public health emergencies. The survey analyzes existing models based on full and partial demand points allocation and compares them based on their features, strengths, and limitations. The study also discusses the challenges and future research directions for PODs location-allocation models. The results highlight the need for developing techniques to meet the demands of specific groups and to consider country-specific variations in population size and density.

FRONTIERS IN PUBLIC HEALTH (2022)

Article Computer Science, Artificial Intelligence

MACC Net: Multi-task attention crowd counting network

Sahar Aldhaheri, Reem Alotaibi, Bandar Alzahrani, Anas Hadi, Arif Mahmood, Areej Alhothali, Ahmed Barnawi

Summary: This paper proposes a multi-task attention based crowd counting network (MACC Net) to address the challenges in crowd density estimation. The network improves counting accuracy through density level classification, density map estimation, and segmentation guided attention. Experimental results on multiple datasets demonstrate that the MACC Net achieves state of the art performance in crowd counting.

APPLIED INTELLIGENCE (2023)

Article Computer Science, Information Systems

Anomalous event detection and localization in dense crowd scenes

Areej Alhothali, Amal Balabid, Reem Alharthi, Bander Alzahrani, Reem Alotaibi, Ahmed Barnawi

Summary: Recognizing and localizing anomalous events in crowd scenes is a challenging problem. This research aims to detect and locate anomalies in dense crowd scenes, proposing a method that combines deep learning with support vector machines.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Fully supervised contrastive learning in latent space for face presentation attack detection

Madini O. Alassafi, Muhammad Sohail Ibrahim, Imran Naseem, Rayed AlGhamdi, Reem Alotaibi, Faris A. Kateb, Hadi Mohsen Oqaibi, Abdulrahman A. Alshdadi, Syed Adnan Yusuf

Summary: The vulnerability of conventional face recognition systems to face presentation or face spoofing attacks has attracted attention. Deep learning-based face presentation attack detection (PAD) methods have gained popularity. This research proposes a supervised contrastive learning approach to tackle the face anti-spoofing problem.

APPLIED INTELLIGENCE (2023)

Article Computer Science, Information Systems

Hybrid Classifiers for Spatio-Temporal Abnormal Behavior Detection, Tracking, and Recognition in Massive Hajj Crowds

Tarik Alafif, Anas Hadi, Manal Allahyani, Bander Alzahrani, Areej Alhothali, Reem Alotaibi, Ahmed Barnawi

Summary: Individual abnormal behaviors vary depending on crowd sizes, contexts, and scenes. Challenges occur in large-scale crowds when detecting, tracking, and recognizing individuals with abnormalities. This paper introduces a large-scale crowd abnormal behavior dataset and proposes a method using hybrid CNNs and RFs to detect and recognize abnormal behaviors.

ELECTRONICS (2023)

Article Computer Science, Artificial Intelligence

A semi supervised approach to Arabic aspect category detection using Bert and teacher-student model

Miada Almasri, Norah Al-Malki, Reem Alotaibi

Summary: This research aims to enhance the capability of a deep learning model, AraBERT v02, for aspect category detection in the Arabic language. The study utilizes a semi-supervised self-training approach called the noisy student framework. Findings show that the ensembled teacher-student model outperforms baselines and other deep learning models in predicting aspect categories.

PEERJ COMPUTER SCIENCE (2023)

Article Engineering, Multidisciplinary

Radiation Dose Tracking in Computed Tomography Using Data Visualization

Reem Alotaibi, Felwa Abukhodair

Summary: Radiation dose tracking is important due to the popularity of CT scans. However, existing software programs for tracking doses have limitations and do not provide accurate answers. This paper proposes a visual analytic approach using Tableau software to track radiation dose data from CT scans, which had a 100% success rate in real-life scenarios and improved the tracking process.

TECHNOLOGIES (2023)

Article Computer Science, Information Systems

A Novel Deep Learning Architecture With Image Diffusion for Robust Face Presentation Attack Detection

Madini O. Alassafi, Muhammad Sohail Ibrahim, Imran Naseem, Rayed AlGhamdi, Reem Alotaibi, Faris A. Kateb, Hadi Mohsen Oqaibi, Abdulrahman A. Alshdadi, Syed Adnan Yusuf

Summary: Face presentation attack detection (PAD) is a crucial step in modern face recognition systems to expose imposters and unauthorized persons. This research proposes a novel face PAD solution using interpolation-based image diffusion and transfer learning of a MobileNet convolutional neural network. The experimental results show that the proposed method outperforms most state-of-the-art methods in terms of performance.

IEEE ACCESS (2023)

Article Mathematics, Applied

Research on management evaluation of enterprise sales cash flow percentage method based on the application of quadratic linear regression equations

Fanxiu Gao, Reem Alotaibi, Mohammed Yousuf Abo Keir

Summary: This article introduces an improved sales percentage method and uses SPSS for regression analysis to predict future sales revenue, calculate future net cash flow, and company value based on the predicted data.

APPLIED MATHEMATICS AND NONLINEAR SCIENCES (2022)

Article Mathematics, Applied

Topological optimisation technology of gravity dam section structure based on ANSYS partial differential equation operation

Xin Guan, Peng Yao, Reem Alotaibi, Mohammed Yousuf Abo Keir

Summary: This paper explains the numerical instability in engineering structure topology optimization using the finite element method. The Gaussian function filtering method is introduced to reduce the global impact of local extremum in the optimization process, and successfully applied in the topology optimization of building structures in hydraulic engineering.

APPLIED MATHEMATICS AND NONLINEAR SCIENCES (2022)

Article Computer Science, Information Systems

A Secure Key Agreement Scheme for Unmanned Aerial Vehicles-Based Crowd Monitoring System

Bander Alzahrani, Ahmed Barnawi, Azeem Irshad, Areej Alhothali, Reem Alotaibi, Muhammad Shafiq

Summary: Unmanned aerial vehicles (UAVs) have gained significant attention in civil and commercial applications, especially in the field of crowd monitoring. However, ensuring the security and privacy of communication between drones and controlling entities remains a critical challenge. This paper proposes an enhanced authenticated key agreement (AKA) solution for secure communication, and its effectiveness is demonstrated through simulation and verification.

CMC-COMPUTERS MATERIALS & CONTINUA (2022)

暂无数据