☆ 4.6 Article

Parsimonious memory unit for recurrent neural networks with application to natural language processing

NEUROCOMPUTING (2018)

Journal

NEUROCOMPUTING

Volume 314, Issue -, Pages 48-64

Publisher

ELSEVIER

DOI: 10.1016/j.neucom.2018.05.081

Keywords

Artificial Intelligence; Recurrent neural networks; Speech and language processing; Classification

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Recurrent Neural Networks (RNN) receive an important interest from Artificial Intelligence researches (AI) this last decade due to their high capability to learn complex internal structures to expose relevant information. However, RNNs fail to reveal long-term dependencies and new RNN with gates have been proposed to address this drawback such as Long Short-Term Memory (LSTM). This RNN-based model requires 4 gates to learn both short and long-term dependencies for a given sequence of basic elements. Recently, a new family of RNN called Gated Recurrent Unit has been introduced. The GRU contains few gates (reset and update gates) but is based on gates grouping without taking into account the latent relations between short and long-term dependencies. The GRU term dependencies management through hidden units is therefore similar for all hidden neurons. Moreover, the learning of gated RNNs requires a large amount of data and, despite the advent of GPU cards that allow the model to be learned quicker, the processing time is quite costly. This paper proposes a new RNN called Parsimonious Memory Unit (PMU) based on the strong assumption that short and long-term dependencies are related and that the role of each hidden neuron has to be different to better handle term dependencies. Experiments conduced on both a small (short-term) spoken dialogues data set from the DECODA project, a large (long-term) textual document corpus from the 20-Newsgroups and a language modeling task, show that the proposed PMU-RNN reaches similar, even better performances (efficiency) with less processing time (improve portability) with a gain of 50%. Moreover, the experiments on the gates' activity show that the proposed PMU manages better term dependencies than the GRU-RNN model. (C) 2018 Elsevier B.V. All rights reserved.

Parsimonious memory unit for recurrent neural networks with application to natural language processing

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Parsimonious memory unit for recurrent neural networks with application to natural language processing

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper