4.4 Article

Using data-driven feature enrichment of text representation and ensemble technique for sentence-level polarity classification

期刊

JOURNAL OF INFORMATION SCIENCE
卷 41, 期 4, 页码 531-549

出版社

SAGE PUBLICATIONS LTD
DOI: 10.1177/0165551515585264

关键词

Sentiment analysis; sentence polarity classification; topic model; word embeddings

资金

  1. National Planning Office of Philosophy and Social Science [14CTQ026]

向作者/读者索取更多资源

As an important issue in sentiment analysis, sentence-level polarity classification plays a critical role in many opinion-mining applications such as opinion question answering, opinion retrieval and opinion summarization. Employing a supervised learning paradigm to train a classifier from sentences often faces the data sparseness problem owing to the short-length limit introduced to texts. In this article, regarding this problem, we exploit two different feature sets learned from external data sets as additional features to enrich data representation: one is a latent topic feature set obtained using a topic model, and the other is a related word feature set derived using word embeddings. Furthermore, we propose an ensemble approach by using these additional features to guide the design of different members of the ensemble. Experimental results on the public movie review dataset demonstrate that the enriched representations are effective for improving the performance of polarity classification, and the proposed ensemble approach can further improve the overall performance.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据