☆ 4.7 Article

High-precision linearized interpretation for fully connected neural network

APPLIED SOFT COMPUTING (2021)

期刊

APPLIED SOFT COMPUTING

卷 109, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.asoc.2021.107572

关键词

Deep neural network; Convergent interpretation; Piecewise linear function; Sigmoid function

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications

资金

National Key R&D Program of China [2018YFB0803700]
CERNET Innovation Project, China [NGII20180406]
Teaching Innovation Project of Communication University of China [JG21008]
Fundamental Research Funds for the Central Universities, China [CUC210A003]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The paper presents an interpretation scheme named CIDNN to provide a provably convergent and consistent interpretation for deep neural networks. By converting deep neural networks into Piecewise Linear Neural Networks and then equivalent linear classifiers, CIDNN's interpretation shows convergence and alignment with similar samples in synthetic datasets. Additionally, the semantic meaning of CIDNN was demonstrated in the Fashion-MNIST dataset.

Despite the widespread application of deep neural networks in finance, medical treatment, and autonomous driving, these networks face multiple security threats, such as maliciously constructed adversarial samples that can easily mislead deep neural network model classification, causing errors. Therefore, creating an interpretable model or designing an interpretation method is necessary to improve its security. This paper presents an interpretation scheme, named Convergent Interpretation for Deep Neural Networks (CIDNN), to obtain a provably convergent and consistent interpretation for deep neural networks. The main idea of CIDNN is to first convert the deep neural networks into a set of mathematically convergent Piecewise Linear Neural Networks (PLNN), then convert the PLNN into a set of equivalent linear classifiers. In this way, each linear classifier can be interpreted by its decision features. By analyzing the convergence of the local approximation interpretation scheme, we prove that this interpretable model can be sufficiently close to the deep neural network with certain conditions. Experiments show the convergence of CIDNN's interpretation, and the interpretation conforms with similar samples in the synthetic dataset. Besides, we demonstrate the semantical meaning of CIDNN in the Fashion-MNIST dataset. (C) 2021 Elsevier B.V. All rights reserved.

High-precision linearized interpretation for fully connected neural network

期刊

APPLIED SOFT COMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

High-precision linearized interpretation for fully connected neural network

期刊

APPLIED SOFT COMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文