4.7 Article

Towards explainable deep neural networks (xDNN)

期刊

NEURAL NETWORKS
卷 130, 期 -, 页码 185-194

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.neunet.2020.07.010

关键词

Explainable AI; Interpretability; Prototype-based models; Deep-learning

向作者/读者索取更多资源

In this paper, we propose an elegant solution that is directly addressing the bottlenecks of the traditional deep learning approaches and offers an explainable internal architecture that can outperform the existing methods, requires very little computational resources (no need for GPUs) and short training times (in the order of seconds). The proposed approach, xDNN is using prototypes. Prototypes are actual training data samples (images), which are local peaks of the empirical data distribution called typicality as well as of the data density. This generative model is identified in a closed form and equates to the pdf but is derived automatically and entirely from the training data with no user- or problem-specific thresholds, parameters or intervention. The proposed xDNN offers a new deep learning architecture that combines reasoning and learning in a synergy. It is non-iterative and non-parametric, which explains its efficiency in terms of time and computational resources. From the user perspective, the proposed approach is clearly understandable to human users. We tested it on challenging problems as the classification of different lighting conditions for driving scenes (iROADS), object detection (Caltech-256, and Caltech-101), and SARS-CoV-2 identification via computed tomography scan (COVID CT-scans dataset). xDNN outperforms the other methods including deep learning in terms of accuracy, time to train and offers an explainable classifier. (C) 2020 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Automation & Control Systems

Self-Evolving Data Cloud-Based PID-Like Controller for Nonlinear Uncertain Systems

Zhao-Xu Yang, Hai-Jun Rong, Pak Kin Wong, Plamen Angelov, Zhi-Xin Yang, Hang Wang

Summary: The SEDCPID controller is constructed using fuzzy rules and data clouds, and has the advantage of evolving structure and simultaneously adapting parameters in an online manner.

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS (2021)

Article Computer Science, Artificial Intelligence

Explaining Deep Learning Models Through Rule-Based Approximation and Visualization

Eduardo Soares, Plamen P. Angelov, Bruno Costa, Marcos P. Gerardo Castro, Subramanya Nageshrao, Dimitar Filev

Summary: This article introduces a novel approach to developing explainable machine learning models by approximating a deep reinforcement learning model with IF-THEN rules and enhancing interpretability through visualizing rules. Experimental results demonstrate the effective interpretability of specific DRL agents and the potential extension to a broader set of deep neural network models.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2021)

Article Computer Science, Artificial Intelligence

Detecting and learning from unknown by extremely weak supervision: exploratory classifier (xClass)

Plamen Angelov, Eduardo Soares

Summary: The paper introduces a new classification method and algorithm that can autonomously detect and learn new classes, with training guided by a minimal amount of labeled data samples. The algorithm automatically selects input features based on data density, generating an interpretable model based on data distribution prototypes.

NEURAL COMPUTING & APPLICATIONS (2021)

Article Chemistry, Analytical

Ensemble-Based Bounding Box Regression for Enhanced Knuckle Localization

Ritesh Vyas, Bryan M. Williams, Hossein Rahmani, Ricki Boswell-Challand, Zheheng Jiang, Plamen Angelov, Sue Black

Summary: The knuckle creases on the dorsal side of the hand can be used to identify offenders of serious crime when other recognizable biometric traits are not available. This paper proposes an ensemble approach using multiple object detector frameworks to accurately localize the knuckle regions. The effectiveness of the approach is tested on large-scale hand databases and its superiority over individual detectors is shown.

SENSORS (2022)

Article Computer Science, Artificial Intelligence

A Novel Multiple Feature-Based Engine Knock Detection System using Sparse Bayesian Extreme Learning Machine

Zhao-Xu Yang, Hai-Jun Rong, Pak Kin Wong, Plamen Angelov, Chi Man Vong, Chi Wai Chiu, Zhi-Xin Yang

Summary: This paper proposes an intelligent engine knock detection system based on engine vibration signals, utilizing VMD for signal filtering and IMF selection, GA for parameter optimization, and a multiple feature learning approach for feature extraction from denoised signals. The features are trained by SBELM to achieve a classification accuracy of 98.27%.

COGNITIVE COMPUTATION (2022)

Article Computer Science, Artificial Intelligence

A Self-Training Hierarchical Prototype-based Ensemble Framework for Remote Sensing Scene Classification

Xiaowei Gu, Ce Zhang, Qiang Shen, Jungong Han, Plamen P. Angelov, Peter M. Atkinson

Summary: A novel semi-supervised ensemble framework was proposed for remote sensing scene classification, utilizing a self-training hierarchical prototype-based classifier to address the challenges of labelled data scarcity and scene complexity. Experimental results demonstrated significant improvements in classification accuracy on popular benchmark datasets with limited labelled images available.

INFORMATION FUSION (2022)

Editorial Material Computer Science, Information Systems

Editorial: Special issue on recent progress in autonomous machine learning

Mahardhika Pratama, Edwin Lughofer, Plamen P. Angelov

INFORMATION SCIENCES (2022)

Article Computer Science, Artificial Intelligence

Statistically Evolving Fuzzy Inference System for Non-Gaussian Noises

Zhao-Xu Yang, Hai-Jun Rong, Plamen Angelov, Zhi-Xin Yang

Summary: This article proposes a novel incremental statistical evolving fuzzy inference system (SEFIS) that can update system parameters and evolve structure components in the presence of non-Gaussian noises. The system generates new rules based on statistical model sufficiency and deletes inactive rules to improve performance and accuracy. Additionally, an adaptive maximum correntropy extend Kalman filter is introduced to update parameters and enhance robustness. Simulation studies demonstrate that the proposed SEFIS has faster learning speed and higher accuracy compared to existing evolving fuzzy systems (EFSs) in both noise-free and noisy conditions.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Multiclass Fuzzily Weighted Adaptive-Boosting-Based Self-Organizing Fuzzy Inference Ensemble Systems for Classification

Xiaowei Gu, Plamen P. Angelov

Summary: This article introduces a novel multiclass fuzzily weighted AdaBoost-based ensemble system using a self-organizing fuzzy inference system as the ensemble component. By utilizing confidence scores from the SOFIS for sample weight updating and ensemble output generation, the proposed FWAdaBoost system achieves more accurate classification boundaries and greater prediction precision, demonstrating effectiveness in various benchmark classification problems.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Person identification from fingernails and knuckles images using deep learning features and the Bray-Curtis similarity measure

Mona Alghamdi, Plamen Angelov, Lopez Pellicer Alvaro

Summary: This paper presents an approach for person identification based on knuckle creases and fingernails. It introduces a framework that includes localization, recognition, segmentation, and similarity matching of hand components. The results show that knuckle patterns and fingernails play a significant role in person identification.

NEUROCOMPUTING (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Hand-Based Person Identification using Global and Part-Aware Deep Feature Representation Learning

Nathanael L. Baisa, Bryan Williams, Hossein Rahmani, Plamen Angelov, Sue Black

Summary: In cases of serious crime, especially sexual abuse, hand images are often the only available information for identification. However, analyzing these images is challenging due to their capture in uncontrolled situations. To address this issue, researchers propose a method that learns global and local feature representations for hand-based person identification. By creating global and local branches on the conv-layer, the method can learn robust discriminative features at both global and part-levels. Evaluations on large datasets demonstrate the significant superiority of the proposed method compared to other approaches.

2022 ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Graph-context Attention Networks for Size-varied Deep Graph Matching

Zheheng Jiang, Hossein Rahmani, Plamen Angelov, Sue Black, Bryan M. Williams

Summary: This study proposes a new method for matching images of different sizes, addressing the challenge through integer linear programming problems and graph-context attention networks. Experimental results demonstrate the superior performance of this method in keypoint matching and graph-level matching.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)

Article Computer Science, Artificial Intelligence

A Semi-Supervised Deep Rule-Based Approach for Complex Satellite Sensor Image Analysis

Xiaowei Gu, Plamen P. Angelov, Ce Zhang, Peter M. Atkinson

Summary: Large-scale satellite sensor images are valuable but challenging data sources for Earth observation. This research proposes a semi-supervised deep rule-based approach (SeRBIA) for autonomous analysis and classification of these images into detailed land-use categories. SeRBIA achieves high accuracy and interpretability by continuously learning from both labelled and unlabelled images using an ensemble feature descriptor.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Automated Person Identification Framework Based on Fingernails and Dorsal Knuckle Patterns

Mona Alghamdi, Plamen Angelov, Bryan Williams

Summary: The study introduces a person identification method that utilizes knuckle creases and fingernail information from hand images. Results indicate that knuckle patterns and fingernails play a significant role in person identification, with fingernails showing slightly higher identification results compared to other hand components.

2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Stochastic Computing co-processing elements for Evolving Autonomous Data Partitioning

Alejandro Moran, Vincent Canals, Plamen P. Angelov, Christian F. Frasser, Erik S. Skibinsky-Gitlin, Joan Font, Eugeni Isern, Miquel Roca, Josep L. Rossello

Summary: This paper proposes a hardware acceleration technique using stochastic computing for the evolving ADP algorithm, showing potential benefits in reducing power consumption in embedded systems. Simulations of the proposed design on different datasets reveal some impact on clustering metrics compared to floating-point designs, with the potential for outperforming in certain cases while maintaining similar results to the original floating-point calculations.

2021 XXXVI CONFERENCE ON DESIGN OF CIRCUITS AND INTEGRATED SYSTEMS (DCIS21) (2021)

Article Computer Science, Artificial Intelligence

Reduced-complexity Convolutional Neural Network in the compressed domain

Hamdan Abdellatef, Lina J. Karam

Summary: This paper proposes performing the learning and inference processes in the compressed domain to reduce computational complexity and improve speed of neural networks. Experimental results show that modified ResNet-50 in the compressed domain is 70% faster than traditional spatial-based ResNet-50 while maintaining similar accuracy. Additionally, a preprocessing step with partial encoding is suggested to improve resilience to distortions caused by low-quality encoded images. Training a network with highly compressed data can achieve good classification accuracy with significantly reduced storage requirements.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Theoretical limits on the speed of learning inverse models explain the rate of adaptation in arm reaching tasks

Victor R. Barradas, Yasuharu Koike, Nicolas Schweighofer

Summary: Inverse models are essential for human motor learning as they map desired actions to motor commands. The shape of the error surface and the distribution of targets in a task play a crucial role in determining the speed of learning.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Learning a robust foundation model against clean-label data poisoning attacks at downstream tasks

Ting Zhou, Hanshu Yan, Jingfeng Zhang, Lei Liu, Bo Han

Summary: We propose a defense strategy that reduces the success rate of data poisoning attacks in downstream tasks by pre-training a robust foundation model.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for neural networks

Hao Sun, Li Shen, Qihuang Zhong, Liang Ding, Shixiang Chen, Jingwei Sun, Jing Li, Guangzhong Sun, Dacheng Tao

Summary: In this paper, the convergence rate of AdaSAM in the stochastic non-convex setting is analyzed. Theoretical proof shows that AdaSAM has a linear speedup property and decouples the stochastic gradient steps with the adaptive learning rate and perturbed gradient. Experimental results demonstrate that AdaSAM outperforms other optimizers in terms of performance.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Grasping detection of dual manipulators based on Markov decision process with neural network

Juntong Yun, Du Jiang, Li Huang, Bo Tao, Shangchun Liao, Ying Liu, Xin Liu, Gongfa Li, Disi Chen, Baojia Chen

Summary: In this study, a dual manipulator grasping detection model based on the Markov decision process is proposed. By parameterizing the grasping detection model of dual manipulators using a cross entropy convolutional neural network and a full convolutional neural network, stable grasping of complex multiple objects is achieved. Robot grasping experiments were conducted to verify the feasibility and superiority of this method.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Asymmetric double networks mutual teaching for unsupervised person Re-identification

Miaohui Zhang, Kaifang Li, Jianxin Ma, Xile Wang

Summary: This paper proposes an unsupervised person re-identification (Re-ID) method that uses two asymmetric networks to generate pseudo-labels for each other by clustering and updates and optimizes the pseudo-labels through alternate training. It also designs similarity compensation and similarity suppression based on the camera ID of pedestrian images to optimize the similarity measure. Extensive experiments show that the proposed method achieves superior performance compared to state-of-the-art unsupervised person re-identification methods.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Low-variance Forward Gradients using Direct Feedback Alignment and momentum

Florian Bacho, Dominique Chu

Summary: This paper proposes a new approach called the Forward Direct Feedback Alignment algorithm for supervised learning in deep neural networks. By combining activity-perturbed forward gradients, direct feedback alignment, and momentum, this method achieves better performance and convergence speed compared to other local alternatives to backpropagation.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Maximum margin and global criterion based-recursive feature selection

Xiaojian Ding, Yi Li, Shilin Chen

Summary: This research paper addresses the limitations of recursive feature elimination (RFE) and its variants in high-dimensional feature selection tasks. The proposed algorithms, which introduce a novel feature ranking criterion and an optimal feature subset evaluation algorithm, outperform current state-of-the-art methods.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Mental image reconstruction from human brain activity: Neural decoding of mental imagery via deep neural network-based Bayesian estimation

Naoko Koide-Majima, Shinji Nishimoto, Kei Majima

Summary: Visual images observed by humans can be reconstructed from brain activity, and the visualization of arbitrary natural images from mental imagery has been achieved through an improved method. This study provides a unique tool for directly investigating the subjective contents of the brain.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Hierarchical attention network with progressive feature fusion for facial expression recognition

Huanjie Tao, Qianyue Duan

Summary: In this paper, a hierarchical attention network with progressive feature fusion is proposed for facial expression recognition (FER), addressing the challenges posed by pose variation, occlusions, and illumination variation. The model achieves enhanced performance by aggregating diverse features and progressively enhancing discriminative features.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

SLAPP: Subgraph-level attention-based performance prediction for deep learning models

Zhenyi Wang, Pengfei Yang, Linwei Hu, Bowen Zhang, Chengmin Lin, Wenkai Lv, Quan Wang

Summary: In the face of the complex landscape of deep learning, we propose a novel subgraph-level performance prediction method called SLAPP, which combines graph and operator features through an innovative graph neural network called EAGAT, providing accurate performance predictions. In addition, we introduce a mixed loss design with dynamic weight adjustment to improve predictive accuracy.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

LDCNet: Lightweight dynamic convolution network for laparoscopic procedures image segmentation

Yiyang Yin, Shuangling Luo, Jun Zhou, Liang Kang, Calvin Yu-Chian Chen

Summary: Medical image segmentation is crucial for modern healthcare systems, especially in reducing surgical risks and planning treatments. Transanal total mesorectal excision (TaTME) has become an important method for treating colon and rectum cancers. Real-time instance segmentation during TaTME surgeries can assist surgeons in minimizing risks. However, the dynamic variations in TaTME images pose challenges for accurate instance segmentation.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

start-stop points CenterNet for wideband signals detection and time-frequency localization in spectrum sensing

Teng Cheng, Lei Sun, Junning Zhang, Jinling Wang, Zhanyang Wei

Summary: This study proposes a scheme that combines the start-stop point signal features for wideband multi-signal detection, called Fast Spectrum-Size Self-Training network (FSSNet). By utilizing start-stop points to build the signal model, this method successfully solves the difficulty of existing deep learning methods in detecting discontinuous signals and achieves satisfactory detection speed.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Learning deep representation and discriminative features for clustering of multi-layer networks

Wenming Wu, Xiaoke Ma, Quan Wang, Maoguo Gong, Quanxue Gao

Summary: The layer-specific modules in multi-layer networks are critical for understanding the structure and function of the system. However, existing methods fail to accurately characterize and balance the connectivity and specificity of these modules. To address this issue, a joint learning graph clustering algorithm (DRDF) is proposed, which learns the deep representation and discriminative features of the multi-layer network, and balances the connectivity and specificity of the layer-specific modules through joint learning.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Boundary uncertainty aware network for automated polyp segmentation

Guanghui Yue, Guibin Zhuo, Weiqing Yan, Tianwei Zhou, Chang Tang, Peng Yang, Tianfu Wang

Summary: This paper proposes a novel boundary uncertainty aware network (BUNet) for precise and robust colorectal polyp segmentation. BUNet utilizes a pyramid vision transformer encoder to learn multi-scale features and incorporates a boundary exploration module (BEM) and a boundary uncertainty aware module (BUM) to handle boundary areas. Experimental results demonstrate that BUNet outperforms other methods in terms of performance and generalization ability.

NEURAL NETWORKS (2024)