4.3 Article

Ant colony optimization for text feature selection in sentiment analysis

Journal

INTELLIGENT DATA ANALYSIS
Volume 23, Issue 1, Pages 133-158

Publisher

IOS PRESS
DOI: 10.3233/IDA-173740

Keywords

Sentiment analysis; metaheuristic algorithm; ant colony optimization; k-nearest neighbour; text feature selection

Funding

  1. Universiti Pertahanan Nasional Malaysia
  2. Ministry of Education Malaysia
  3. Fundamental Research Grant Scheme [FRGS/1/2016/ICT02/UKM/01/2]

Ask authors/readers for more resources

In sentiment analysis, the high dimensionality of the feature vector is a key problem because it can decrease the accuracy of sentiment classification and make it difficult to obtain the optimum subset of features. To solve this problem, this study proposes a new text feature selection method that uses a wrapper approach, integrated with ant colony optimization (ACO) to guide the feature selection process. It also uses the k-nearest neighbour (KNN) as a classifier to evaluate and generate a candidate subset of optimum features. To test the subset of optimum features, algorithm dependency relations were used to find the relationship between the feature and the sentiment word in customer reviews. The output of the feature subset, which was derived using the proposed ACO-KNN algorithm, was used as an input to identify and extract sentiment words from sentences in customer reviews. The resulting relationship between features and sentiment words was tested and evaluated to determine the accuracy based on precision, recall, and F-score. The performance of the proposed ACO-KNN algorithm on customer review datasets was evaluated and compared with that of two hybrid algorithms from the literature, namely, the genetic algorithm with information gain and information gain with rough set attribute reduction. The results of the experiments showed that the proposed ACO-KNN algorithm was able to obtain the optimum subset of features and can improve the accuracy of sentiment classification.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Review Computer Science, Artificial Intelligence

A review of feature selection techniques in sentiment analysis

Siti Rohaidah Ahmad, Azuraliza Abu Bakar, Mohd Ridzwan Yaakub

INTELLIGENT DATA ANALYSIS (2019)

Article Computer Science, Hardware & Architecture

A New Real-Time Link Prediction Method Based on User Community Changes in Online Social Networks

Amin Mahmoudi, Mohd Ridzwan Yaakub, Azuraliza Abu Bakar

COMPUTER JOURNAL (2020)

Article Computer Science, Artificial Intelligence

Hybrid N-gram model using Naive Bayes for classification of political sentiments on Twitter

Jamilu Awwalu, Azuraliza Abu Bakar, Mohd Ridzwan Yaakub

NEURAL COMPUTING & APPLICATIONS (2019)

Article Computer Science, Information Systems

The Relationship between Online Social Network Ties and User Attributes

Amin Mahmoudi, Mohd Ridzwan Yaakub, Azuraliza Abu Bakar

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2019)

Review Computer Science, Information Systems

Movie Revenue Prediction Based on Purchase Intention Mining Using YouTube Trailer Reviews

Ibrahim Said Ahmad, Azuraliza Abu Bakar, Mohd Ridzwan Yaakub

INFORMATION PROCESSING & MANAGEMENT (2020)

Review Multidisciplinary Sciences

A review of fake news detection approaches: A critical analysis of relevant studies and highlighting key challenges associated with the dataset, feature representation, and data fusion

Suhaib Kh Hamed, Mohd Juzaiddin Ab Aziz, Mohd Ridzwan Yaakub

Summary: Social networks have become the main source for news consumption, but the spread of fake news on these platforms has negative consequences. Many studies have proposed effective models for detecting fake news in social networks, but their accuracy is often insufficient. Previous reviews have focused on specific aspects of fake news detection models, overlooking the impact of datasets, features, and fusion methods. This review analyzes recent studies to highlight the challenges and performance implications of fake news detection models.

HELIYON (2023)

Proceedings Paper Computer Science, Cybernetics

The Development of Electroencephalogram (EEG) in Neuromarketing Using Hedonic and Utilitarian Motivation

Nurul Natasha Awinda Mohammad Nizam, Mohd Fahmi Mohamad Amran, Nurhafizah Moziyana Mohd Yusop, Siti Rohaidah Ahmad, Norshahriah Abdul Wahab

Summary: This paper examines the rapid growth of eCommerce in Malaysia, particularly during the Covid-19 pandemic, and explores how consumer behavior and decision making are influenced by emotions. It focuses on the emerging field of neuromarketing study, specifically using the electroencephalogram (EEG) technique. The paper aims to determine consumer behaviors towards marketing stimuli and how their emotions are influenced by these stimuli.

CYBERNETICS PERSPECTIVES IN SYSTEMS, VOL 3 (2022)

Article Computer Science, Information Systems

A Hybrid Metaheuristic Method in Training Artificial Neural Network for Bankruptcy Prediction

Abdollah Ansari, Ibrahim Said Ahmad, Azuraliza Abu Bakar, Mohd Ridzwan Yaakub

IEEE ACCESS (2020)

Article Computer Science, Information Systems

A Temporal User Attribute-Based Algorithm to Detect Communities in Online Social Networks

Amin Mahmoudi, Azuraliza Abu Bakar, Mehdi Sookhak, Mohd Ridzwan Yaakub

IEEE ACCESS (2020)

Article Education & Educational Research

Identifying priority antecedents of educational data mining acceptance using importance-performance matrix analysis

Muslihah Wook, Suhaila Ismail, Nurhafizah Moziyana Mohd Yusop, Siti Rohaidah Ahmad, Arniyati Ahmad

EDUCATION AND INFORMATION TECHNOLOGIES (2019)

Article Computer Science, Theory & Methods

Product Feature Ranking and Popularity Model based on Sentiment Comments

Siti Rohaidah Ahmad, Nurhafizah Moziyana Mohd Yusop, Muslihah Wook, Arniyati Ahmad, Azuraliza Abu Bakar, Mohd Ridzwan Yaakub

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS (2018)

Article Computer Science, Theory & Methods

Beyond Sentiment Classification: A Novel Approach for Utilizing Social Media Data for Business Intelligence

Ibrahim Said Ahmad, Azuraliza Abu Bakar, Mohd Ridzwan Yaakub, Mohammad Darwich

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS (2020)

No Data Available