☆ 4.6 Article

From Feedforward to Recurrent LSTM Neural Networks for Language Modeling

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2015)

期刊

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

卷 23, 期 3, 页码 517-529

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TASLP.2015.2400218

关键词

Feedforward neural network; Kneser-Ney smoothing; language modeling; long short-term memory (LSTM); recurrent neural network (RNN)

类别

Acoustics Engineering, Electrical & Electronic

资金

OSEO, French State agency for innovation
Quaero programme
European Union [287658, 287755]
DIGITEO, a French research cluster in Ile-de-France
JARA-HPC from RWTH Aachen University [jara0085]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Language models have traditionally been estimated based on relative frequencies, using count statistics that can be extracted from huge amounts of text data. More recently, it has been found that neural networks are particularly powerful at estimating probability distributions over word sequences, giving substantial improvements over state-of-the-art count models. However, the performance of neural network language models strongly depends on their architectural structure. This paper compares count models to feedforward, recurrent, and long short-term memory (LSTM) neural network variants on two large-vocabulary speech recognition tasks. We evaluate the models in terms of perplexity and word error rate, experimentally validating the strong correlation of the two quantities, which we find to hold regardless of the underlying type of the language model. Furthermore, neural networks incur an increased computational complexity compared to count models, and they differently model context dependences, often exceeding the number of words that are taken into account by count based approaches. These differences require efficient search methods for neural networks, and we analyze the potential improvements that can be obtained when applying advanced algorithms to the rescoring of word lattices on large-scale setups.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.6

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

GBC: An Energy-Efficient LSTM Accelerator With Gating Units Level Balanced Compression Strategy

Bi Wu, Zhengkuan Wang, Ke Chen, Chenggang Yan, Weiqiang Liu

Summary: This paper investigates the Gating Units Level Balanced Compression (GBC) strategy in recurrent neural networks (RNNs), achieving high compression rates for long short-term memory (LSTM) models while reducing additional parameters. Experimental results demonstrate that GBC significantly reduces storage space and computational complexity of LSTM models while maintaining accuracy, and hardware experiments show improved energy efficiency compared to state-of-the-art designs.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS (2022)