4.7 Article

Learning a discriminative SPD manifold neural network for image set classification

期刊

NEURAL NETWORKS
卷 151, 期 -, 页码 94-110

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.neunet.2022.03.012

关键词

SPD manifold neural network; Image set classification; Metric learning; Riemannian barycenter; Riemannian optimization

资金

  1. National Natural Science Foundation of China [62020106012, U1836218, 61672265, 621060 89 62006097]
  2. 111 Project of Ministry of Education of China [B12018]
  3. Natural Science Foundation of Jiangsu Province, China [BK20200593]
  4. Postgraduate Research & Practice Innovation Program of Jiangsu Province, China [KYCX21-2006]
  5. UK EPSRC [EP/N00 7743/1]
  6. MURI/EPSRC/DSTL, UK [MURI/EPSRC/DSTL]
  7. National Key Research and Development Program of China [UK EP/R018456/1]
  8. National Key Research and Development Program of China [2017YFC1601800]

向作者/读者索取更多资源

This paper investigates pattern analysis on the symmetric positive definite manifold and designs two Riemannian operation modules for neural networks. Experimental results demonstrate the effectiveness of the proposed approach.
Performing pattern analysis over the symmetric positive definite (SPD) manifold requires specific mathematical computations, characterizing the non-Euclidian property of the involved data points and learning tasks, such as the image set classification problem. Accompanied with the advanced neural networking techniques, several architectures for processing the SPD matrices have recently been studied to obtain fine-grained structured representations. However, existing approaches are challenged by the diversely changing appearance of the data points, begging the question of how to learn invariant representations for improved performance with supportive theories. Therefore, this paper designs two Riemannian operation modules for SPD manifold neural network. Specifically, a Riemannian batch regularization (RBR) layer is firstly proposed for the purpose of training a discriminative manifold-to-manifold transforming network with a novelly-designed metric learning regularization term. The second module realizes the Riemannian pooling operation with geometric computations on the Riemannian manifolds, notably the Riemannian barycenter, metric learning, and Riemannian optimization. Extensive experiments on five benchmarking datasets show the efficacy of the proposed approach.(C)& nbsp; 2022 Published by Elsevier Ltd.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Automation & Control Systems

Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval

Donglin Zhang, Xiao-Jun Wu, Tianyang Xu, Josef Kittler

Summary: This paper proposes a novel two-stage supervised discrete hashing (TSDH) method to address the issues in existing cross-media hashing approaches. By generating latent representations and binary codes in a common hash space, and by directly endowing the hash codes with semantic labels and using a discrete hash optimization approach, the discriminative power of learned binary codes can be enhanced.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2022)

Article Engineering, Electrical & Electronic

FEXNet: Foreground Extraction Network for Human Action Recognition

Zhongwei Shen, Xiao-Jun Wu, Tianyang Xu

Summary: This paper proposes a Foreground EXtraction (FEX) block to disentangle foregrounds from the background for advanced action recognition systems. The FEX block contains a Foreground Enhancement (FE) module and a Scene Segregation (SS) module, which effectively models foreground clues and splits feature maps for action inference.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Computer Science, Artificial Intelligence

SymNet: A Simple Symmetric Positive Definite Manifold Deep Learning Method for Image Set Classification

Rui Wang, Xiao-Jun Wu, Josef Kittler

Summary: A SymNet network was proposed for image set classification, which utilized SPD matrix mapping layers, rectifying layers, pooling layers, and log-map layer to achieve effective feature learning and data compression. PCA and KDA algorithms were applied for discriminative subspace learning, and extensive experiments validated the feasibility and effectiveness of the proposed SymNet.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Global Context-Aware Feature Extraction and Visible Feature Enhancement for Occlusion-Invariant Pedestrian Detection in Crowded Scenes

Zhenxing Liu, Xiaoning Song, Zhenhua Feng, Tianyang Xu, Xiaojun Wu, Josef Kittler

Summary: This passage discusses the progress and challenges in pedestrian detection research, proposes a method to enhance pedestrian detection by extracting effective features using contextual information, and validates the effectiveness of the proposed method through experimental results on two benchmark datasets.

NEURAL PROCESSING LETTERS (2023)

Article Chemistry, Physical

Synergistic Multiple Bonds Induced Dynamic Self-Assembly of Silver Nanoclusters into Lamellar Frameworks with Tailored Luminescence

Yafang Hou, Yuqing Wang, Tianyang Xu, Zhi Wang, Weidong Tian, Di Sun, Xinyue Yu, Pengyao Xing, Jinglin Shen, Xia Xin, Jingcheng Hao

Summary: This article demonstrates a multi-bond-induced hierarchical self-assembly method, which utilizes atomically precise silver nanoclusters to achieve ordered layer-by-layer construction of metal-organic frameworks. The luminescence properties can be reversibly switched by tuning the pH values. This method has potential applications in the fields of luminescent devices and sensors.

CHEMISTRY OF MATERIALS (2022)

Article Computer Science, Artificial Intelligence

Target-Cognisant Siamese Network for Robust Visual Object Tracking *

Yingjie Jiang, Xiaoning Song, Tianyang Xu, Zhenhua Feng, Xiaojun Wu, Josef Kittler

Summary: Siamese trackers have become the mainstream framework for visual object tracking in recent years. This paper proposes a target-cognisant Siamese network that enhances the interaction between the classification and regression branches, and introduces attention mechanisms and filtering modules to improve the tracking performance. Experimental results demonstrate the competitiveness of the proposed method.

PATTERN RECOGNITION LETTERS (2022)

Article Computer Science, Artificial Intelligence

Geometry-Aware Graph Embedding Projection Metric Learning for Image Set Classification

Rui Wang, Xiao-Jun Wu, Zhen Liu, Josef Kittler

Summary: This paper proposes a geometry-aware graph embedding projection metric learning algorithm to address the challenges of intraclass diversity and interclass similarity in image set classification. The algorithm constructs similarity graphs and utilizes local structural information on the Grassmann manifold for graph learning, and formulates the dimensionality reduction problem into a metric learning regularization term.

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

U-SPDNet: An SPD manifold learning-based neural network for visual classification

Rui Wang, Xiao-Jun Wu, Tianyang Xu, Cong Hu, Josef Kittler

Summary: This paper proposes a U-shaped neural network (U-SPDNet) based on SPD manifolds for visual classification. The U-SPDNet consists of an encoder and a decoder to extract and reconstruct image features, respectively, and addresses the degradation of structural information. Additionally, skip connections and geometric operations are employed to enhance the representational capacity of U-SPDNet, resulting in improved accuracy on multiple datasets.

NEURAL NETWORKS (2023)

Article Computer Science, Information Systems

Hybrid Riemannian Graph-Embedding Metric Learning for Image Set Classification

Ziheng Chen, Tianyang Xu, Xiao-Jun Wu, Rui Wang, Josef Kittler

Summary: With the increasing amount of video data, image set classification has become a popular topic in the field of computer vision and pattern recognition. However, the diversity within classes and ambiguity between classes pose a challenge. To address this, multiple geometry-aware image set modelling and learning methods have been proposed. In this paper, we propose a hybrid Riemannian metric learning framework that effectively fuses complementary kernel features obtained from different manifolds into a unified subspace for classification. Our approach achieves improved efficiency and outperforms state-of-the-art methods according to experimental results.

IEEE TRANSACTIONS ON BIG DATA (2023)

Article Computer Science, Artificial Intelligence

LRRNet: A Novel Representation Learning Guided Fusion Network for Infrared and Visible Images

Hui Li, Tianyang Xu, Xiao-Jun Wu, Jiwen Lu, Josef Kittler

Summary: Deep learning based fusion methods have achieved promising performance in image fusion tasks due to the importance of network architecture. However, designing fusion networks is still a challenging task. In this paper, the fusion task is mathematically formulated and a connection between the optimal solution and network architecture is established. This leads to the proposal of a lightweight fusion network based on a learnable representation approach.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Learning Motion-Perceive Siamese network for robust visual object tracking

Ze Kang, Tianyang Xu, Xue-Feng Zhu, Xiao-Jun Wu

Summary: Traditional Siamese networks for visual tracking rely on offline-trained appearance models for each frame, disregarding temporal variation at the online stage. We propose a novel Motion-Perceive Siamese network (SiamMP) that explicitly predicts motion patterns to enhance appearance-only formulation.

PATTERN RECOGNITION LETTERS (2023)

Article Computer Science, Artificial Intelligence

Toward Robust Visual Object Tracking With Independent Target-Agnostic Detection and Effective Siamese Cross-Task Interaction

Tianyang Xu, Zhenhua Feng, Xiao-Jun Wu, Josef Kittler

Summary: In this study, a novel network with a target-agnostic object detection module is proposed to complement direct target inference and minimize the misalignment of key cues in potential template-instance matches. A cross-task interaction module is developed to ensure consistent supervision of classification and regression branches, improving their synergy. Adaptive labels are assigned to effectively supervise network training. Experimental results on various benchmarks demonstrate the effectiveness of the advanced target detection module and cross-task interaction, outperforming state-of-the-art tracking methods.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Computer Science, Artificial Intelligence

Discriminative Dictionary Pair Learning With Scale-Constrained Structured Representation for Image Classification

Zhe Chen, Xiao-Jun Wu, Tianyang Xu, Josef Kittler

Summary: The DPL-SCSR algorithm proposed in this article utilizes the label matrix of the dictionary to project the representation and approximate a block-diagonal structure by imposing non-negative constraint and controlling scale. It seamlessly integrates a linear classifier and feature extraction process, reducing training and parameter tuning complexity. Experimental results on image classification datasets show its superiority over state-of-the-art dictionary learning methods.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Reduced-complexity Convolutional Neural Network in the compressed domain

Hamdan Abdellatef, Lina J. Karam

Summary: This paper proposes performing the learning and inference processes in the compressed domain to reduce computational complexity and improve speed of neural networks. Experimental results show that modified ResNet-50 in the compressed domain is 70% faster than traditional spatial-based ResNet-50 while maintaining similar accuracy. Additionally, a preprocessing step with partial encoding is suggested to improve resilience to distortions caused by low-quality encoded images. Training a network with highly compressed data can achieve good classification accuracy with significantly reduced storage requirements.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Theoretical limits on the speed of learning inverse models explain the rate of adaptation in arm reaching tasks

Victor R. Barradas, Yasuharu Koike, Nicolas Schweighofer

Summary: Inverse models are essential for human motor learning as they map desired actions to motor commands. The shape of the error surface and the distribution of targets in a task play a crucial role in determining the speed of learning.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Learning a robust foundation model against clean-label data poisoning attacks at downstream tasks

Ting Zhou, Hanshu Yan, Jingfeng Zhang, Lei Liu, Bo Han

Summary: We propose a defense strategy that reduces the success rate of data poisoning attacks in downstream tasks by pre-training a robust foundation model.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for neural networks

Hao Sun, Li Shen, Qihuang Zhong, Liang Ding, Shixiang Chen, Jingwei Sun, Jing Li, Guangzhong Sun, Dacheng Tao

Summary: In this paper, the convergence rate of AdaSAM in the stochastic non-convex setting is analyzed. Theoretical proof shows that AdaSAM has a linear speedup property and decouples the stochastic gradient steps with the adaptive learning rate and perturbed gradient. Experimental results demonstrate that AdaSAM outperforms other optimizers in terms of performance.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Grasping detection of dual manipulators based on Markov decision process with neural network

Juntong Yun, Du Jiang, Li Huang, Bo Tao, Shangchun Liao, Ying Liu, Xin Liu, Gongfa Li, Disi Chen, Baojia Chen

Summary: In this study, a dual manipulator grasping detection model based on the Markov decision process is proposed. By parameterizing the grasping detection model of dual manipulators using a cross entropy convolutional neural network and a full convolutional neural network, stable grasping of complex multiple objects is achieved. Robot grasping experiments were conducted to verify the feasibility and superiority of this method.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Asymmetric double networks mutual teaching for unsupervised person Re-identification

Miaohui Zhang, Kaifang Li, Jianxin Ma, Xile Wang

Summary: This paper proposes an unsupervised person re-identification (Re-ID) method that uses two asymmetric networks to generate pseudo-labels for each other by clustering and updates and optimizes the pseudo-labels through alternate training. It also designs similarity compensation and similarity suppression based on the camera ID of pedestrian images to optimize the similarity measure. Extensive experiments show that the proposed method achieves superior performance compared to state-of-the-art unsupervised person re-identification methods.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Low-variance Forward Gradients using Direct Feedback Alignment and momentum

Florian Bacho, Dominique Chu

Summary: This paper proposes a new approach called the Forward Direct Feedback Alignment algorithm for supervised learning in deep neural networks. By combining activity-perturbed forward gradients, direct feedback alignment, and momentum, this method achieves better performance and convergence speed compared to other local alternatives to backpropagation.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Maximum margin and global criterion based-recursive feature selection

Xiaojian Ding, Yi Li, Shilin Chen

Summary: This research paper addresses the limitations of recursive feature elimination (RFE) and its variants in high-dimensional feature selection tasks. The proposed algorithms, which introduce a novel feature ranking criterion and an optimal feature subset evaluation algorithm, outperform current state-of-the-art methods.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Mental image reconstruction from human brain activity: Neural decoding of mental imagery via deep neural network-based Bayesian estimation

Naoko Koide-Majima, Shinji Nishimoto, Kei Majima

Summary: Visual images observed by humans can be reconstructed from brain activity, and the visualization of arbitrary natural images from mental imagery has been achieved through an improved method. This study provides a unique tool for directly investigating the subjective contents of the brain.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Hierarchical attention network with progressive feature fusion for facial expression recognition

Huanjie Tao, Qianyue Duan

Summary: In this paper, a hierarchical attention network with progressive feature fusion is proposed for facial expression recognition (FER), addressing the challenges posed by pose variation, occlusions, and illumination variation. The model achieves enhanced performance by aggregating diverse features and progressively enhancing discriminative features.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

SLAPP: Subgraph-level attention-based performance prediction for deep learning models

Zhenyi Wang, Pengfei Yang, Linwei Hu, Bowen Zhang, Chengmin Lin, Wenkai Lv, Quan Wang

Summary: In the face of the complex landscape of deep learning, we propose a novel subgraph-level performance prediction method called SLAPP, which combines graph and operator features through an innovative graph neural network called EAGAT, providing accurate performance predictions. In addition, we introduce a mixed loss design with dynamic weight adjustment to improve predictive accuracy.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

LDCNet: Lightweight dynamic convolution network for laparoscopic procedures image segmentation

Yiyang Yin, Shuangling Luo, Jun Zhou, Liang Kang, Calvin Yu-Chian Chen

Summary: Medical image segmentation is crucial for modern healthcare systems, especially in reducing surgical risks and planning treatments. Transanal total mesorectal excision (TaTME) has become an important method for treating colon and rectum cancers. Real-time instance segmentation during TaTME surgeries can assist surgeons in minimizing risks. However, the dynamic variations in TaTME images pose challenges for accurate instance segmentation.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

start-stop points CenterNet for wideband signals detection and time-frequency localization in spectrum sensing

Teng Cheng, Lei Sun, Junning Zhang, Jinling Wang, Zhanyang Wei

Summary: This study proposes a scheme that combines the start-stop point signal features for wideband multi-signal detection, called Fast Spectrum-Size Self-Training network (FSSNet). By utilizing start-stop points to build the signal model, this method successfully solves the difficulty of existing deep learning methods in detecting discontinuous signals and achieves satisfactory detection speed.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Learning deep representation and discriminative features for clustering of multi-layer networks

Wenming Wu, Xiaoke Ma, Quan Wang, Maoguo Gong, Quanxue Gao

Summary: The layer-specific modules in multi-layer networks are critical for understanding the structure and function of the system. However, existing methods fail to accurately characterize and balance the connectivity and specificity of these modules. To address this issue, a joint learning graph clustering algorithm (DRDF) is proposed, which learns the deep representation and discriminative features of the multi-layer network, and balances the connectivity and specificity of the layer-specific modules through joint learning.

NEURAL NETWORKS (2024)

Article Computer Science, Artificial Intelligence

Boundary uncertainty aware network for automated polyp segmentation

Guanghui Yue, Guibin Zhuo, Weiqing Yan, Tianwei Zhou, Chang Tang, Peng Yang, Tianfu Wang

Summary: This paper proposes a novel boundary uncertainty aware network (BUNet) for precise and robust colorectal polyp segmentation. BUNet utilizes a pyramid vision transformer encoder to learn multi-scale features and incorporates a boundary exploration module (BEM) and a boundary uncertainty aware module (BUM) to handle boundary areas. Experimental results demonstrate that BUNet outperforms other methods in terms of performance and generalization ability.

NEURAL NETWORKS (2024)