4.6 Article

Attention-Based RU-BiLSTM Sentiment Analysis Model for Roman Urdu

Journal

APPLIED SCIENCES-BASEL
Volume 12, Issue 7, Pages -

Publisher

MDPI
DOI: 10.3390/app12073641

Keywords

sentiment analysis; deep learning; Roman Urdu; neural networks; LSTM; attention networks; text processing; natural language processing

Ask authors/readers for more resources

Deep neural networks are widely used in natural language processing tasks, but traditional deep networks fail to preserve contextual information in sequential data such as text. This paper proposes a deep recurrent architecture, RU-BiLSTM, for sentiment analysis of Roman Urdu. The model uses bidirectional LSTM to preserve context and an attention mechanism to focus on important features, achieving significant improvements over baseline models.
Deep neural networks have emerged as a leading approach towards handling many natural language processing (NLP) tasks. Deep networks initially conquered the problems of computer vision. However, dealing with sequential data such as text and sound was a nightmare for such networks as traditional deep networks are not reliable in preserving contextual information. This may not harm the results in the case of image processing where we do not care about the sequence, but when we consider the data collected from text for processing, such networks may trigger disastrous results. Moreover, establishing sentence semantics in a colloquial text such as Roman Urdu is a challenge. Additionally, the sparsity and high dimensionality of data in such informal text have encountered a significant challenge for building sentence semantics. To overcome this problem, we propose a deep recurrent architecture RU-BiLSTM based on bidirectional LSTM (BiLSTM) coupled with word embedding and an attention mechanism for sentiment analysis of Roman Urdu. Our proposed model uses the bidirectional LSTM to preserve the context in both directions and the attention mechanism to concentrate on more important features. Eventually, the last dense softmax output layer is used to acquire the binary and ternary classification results. We empirically evaluated our model on two available datasets of Roman Urdu, i.e., RUECD and RUSA-19. Our proposed model outperformed the baseline models on many grounds, and a significant improvement of 6% to 8% is achieved over baseline models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available