☆ 4.6 Article

Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis

NEUROCOMPUTING (2017)

期刊

NEUROCOMPUTING

卷 241, 期 -, 页码 18-27

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2017.01.079

关键词

Sentiment analysis; Recursive autoencoder; HowNet lexicon; Phrase structure tree

类别

Computer Science, Artificial Intelligence

资金

National Natural Science Foundation of China [61472258, 61402294]
National Key Technology Research and Development Program of the Ministry of Science and Technology of China [2014BAH28F05]
Science and Technology Foundation of Shenzhen City [JCYJ20140509172609162, JCYJ20130329102032059]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Detecting sentiment of sentences in online reviews is still a challenging task. Traditional machine learning methods often use bag-of-words representations which cannot properly capture complex linguistic phenomena in sentiment analysis. Recently, recursive autoencoder (RAE) methods have been proposed for sentence-level sentiment analysis. They use word embedding to represent each word, and learn compositional vector representation of phrases and sentences with recursive autoencoders. Although RAE methods outperform other state-of-the-art sentiment prediction approaches on commonly used datasets, they tend to generate very deep parse trees, and need a large amount of labeled data for each node during the process of learning compositional vector representations. Furthermore, RAE methods mainly combine adjacent words in sequence with a greedy strategy, which make capturing semantic relations between distant words difficult. To solve these issues, we propose a semi-supervised method which combines HowNet lexicon to train phrase recursive autoencoders (we call it CHL-PRAE). CHL-PRAE constructs the phrase recursive autoencoder (PRAE) model at first. Then the model calculates the sentiment orientation of each node with the HowNet lexicon, which acts as sentiment labels, when we train the softmax classifier of PRAE. Furthermore, our CHL-PRAE model conducts bidirectional training to capture global information. Compared with RAE and some supervised methods such as support vector machine (SVM) and naive Bayesian on English and Chinese datasets, the experiment results show that CHL-PRAE can provide the best performance for sentence-level sentiment analysis. (C) 2017 Elsevier B.V. All rights reserved.

Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文