4.6 Article

Comparison Research on Text Pre-processing Methods on Twitter Sentiment Analysis

期刊

IEEE ACCESS
卷 5, 期 -, 页码 2870-2879

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2017.2672677

关键词

Twitter; sentiment analysis; text pre-processing

资金

  1. NSFC [1472316]
  2. Shaanxi Science and Technology [2016ZDJC-05, 2013SZS16-Z01/P01/K01]
  3. Fundamental Research Funds for Ministry of Education of China [XKJC2014008]

向作者/读者索取更多资源

Twitter sentiment analysis offers organizations ability to monitor public feeling towards the products and events related to them in real time. The first step of the sentiment analysis is the text preprocessing of Twitter data. Most existing researches about Twitter sentiment analysis are focused on the extraction of new sentiment features. However, to select the pre-processing method is ignored. This paper discussed the effects of text pre-processing method on sentiment classification performance in two types of classification tasks, and summed up the classification performances of six pre-processing methods using two feature models and four classifiers on five Twitter datasets. The experiments show that the accuracy and F1-measure of Twitter sentiment classification classifier are improved when using the pre-processing methods of expanding acronyms and replacing negation, but barely changes when removing URLs, removing numbers or stop words. The Naive Bayes and Random Forest classifiers are more sensitive than Logistic Regression and support vector machine classifiers when various pre-processing methods were applied.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Deep Convolution Neural Networks for Twitter Sentiment Analysis

Zhao Jianqiang, Gui Xiaolin, Zhang Xuejun

IEEE ACCESS (2018)

暂无数据