☆ 4.6 Article

Interpreting Recurrent Neural Networks Behaviour via Excitable Network Attractors

COGNITIVE COMPUTATION (2020)

期刊

COGNITIVE COMPUTATION

卷 12, 期 2, 页码 330-356

出版社

SPRINGER

DOI: 10.1007/s12559-019-09634-2

关键词

Recurrent neural networks; Dynamical systems; Network attractors; Bifurcations

类别

Computer Science, Artificial Intelligence Neurosciences

资金

EPSRC [EP/N014391/1] Funding Source: UKRI

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Machine learning provides fundamental tools both for scientific research and for the development of technologies with significant impact on society. It provides methods that facilitate the discovery of regularities in data and that give predictions without explicit knowledge of the rules governing a system. However, a price is paid for exploiting such flexibility: machine learning methods are typically black boxes where it is difficult to fully understand what the machine is doing or how it is operating. This poses constraints on the applicability and explainability of such methods. Our research aims to open the black box of recurrent neural networks, an important family of neural networks used for processing sequential data. We propose a novel methodology that provides a mechanistic interpretation of behaviour when solving a computational task. Our methodology uses mathematical constructs called excitable network attractors, which are invariant sets in phase space composed of stable attractors and excitable connections between them. As the behaviour of recurrent neural networks depends both on training and on inputs to the system, we introduce an algorithm to extract network attractors directly from the trajectory of a neural network while solving tasks. Simulations conducted on a controlled benchmark task confirm the relevance of these attractors for interpreting the behaviour of recurrent neural networks, at least for tasks that involve learning a finite number of stable states and transitions between them.

Interpreting Recurrent Neural Networks Behaviour via Excitable Network Attractors

期刊

COGNITIVE COMPUTATION

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Interpreting Recurrent Neural Networks Behaviour via Excitable Network Attractors

期刊

COGNITIVE COMPUTATION

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文