4.6 Article

Multi-level region-based Convolutional Neural Network for image emotion classification

Journal

NEUROCOMPUTING
Volume 333, Issue -, Pages 429-439

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2018.12.053

Keywords

Image emotion classification; Multi-level; Region-based; Convolutional Neural Network

Ask authors/readers for more resources

Analyzing emotional information of visual content has attracted growing attention for the tendency of internet users to share their feelings via images and videos online. In this paper, we investigate the problem of affective image analysis, which is very challenging due to its complexity and subjectivity. Previous research reveals that image emotion is related to low-level to high-level visual features from both global and local view, while most of the current approaches only focus on improving emotion recognition performance based on single-level visual features from a global view. Aiming to utilize different levels of visual features from both global and local view, we propose a multi-level region-based Convolutional Neural Network (CNN) framework to discover the sentimental response of local regions. We first employ Feature Pyramid Network (FPN) to extract multi-level deep representations. Then, an emotional region proposal method is used to generate proper local regions and remove excessive non-emotional regions for image emotion classification. Finally, to deal with the subjectivity in emotional labels, we propose a multi-task loss function to take the probabilities of images belonging to different emotion classes into consideration. Extensive experiments show that our method outperforms the state-of-the-art approaches on various commonly used benchmark datasets. (C) 2018 Published by Elsevier B.V.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

LGAttNet: Automatic micro-expression detection using dual-stream local and global attentions

Madhumita A. Takalkar, Selvarajah Thuseethan, Sutharshan Rajasegarar, Zenon Chaczko, Min Xu, John Yearwood

Summary: Research in the field of micro-expressions has become increasingly important, with the proposal of an architecture called LGAttNet, which utilizes a dual attention network for frame-wise automatic micro-expression detection. This method extracts local and global facial features using an attention network, showing robustness and superiority compared to state-of-the-art approaches on publicly available databases.

KNOWLEDGE-BASED SYSTEMS (2021)

Article Computer Science, Artificial Intelligence

GEME: Dual-stream multi-task GEnder-based micro-expression recognition

Xuan Nie, Madhumita A. Takalkar, Mengyang Duan, Haimin Zhang, Min Xu

Summary: This research introduces a dual-stream multi-task framework called GEME to improve micro-expression recognition accuracy by incorporating gender characteristics. By selecting gender-specific features of micro-expressions and adding them to the micro-expression features, the study shows an improvement in recognition accuracy. The network learns gender-specific and micro-expression features together to achieve a better representation.

NEUROCOMPUTING (2021)

Article Engineering, Electrical & Electronic

Noise Augmented Double-Stream Graph Convolutional Networks for Image Captioning

Lingxiang Wu, Min Xu, Lei Sang, Ting Yao, Tao Mei

Summary: The proposed NADGCN model utilizes grid-stream GCN as a supplement to the region stream and enhances the generalization of the language model by adding a noise module. Experimental results show that it outperforms the comparative baseline models.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2021)

Article Computer Science, Information Systems

Recognition of Emotions in User-Generated Videos through Frame-Level Adaptation and Emotion Intensity Learning

Haimin Zhang, Min Xu

Summary: This research aims to extract frame-level features and emotion intensities by transferring emotional information from an image emotion dataset for video representation. An end-to-end network is proposed for joint emotion recognition and intensity learning with unsupervised adversarial adaptation. The experimental results show improved performance compared to previous state-of-the-art methods.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Information Systems

Multiscale Emotion Representation Learning for Affective Image Recognition

Haimin Zhang, Min Xu

Summary: In this paper, a region-based multiscale network is proposed for learning features for local affective regions and the broad context for affective image recognition. Experimental results show that integrating features from the broad context is effective in improving the recognition performance.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

Graph neural networks with multiple kernel ensemble attention

Haimin Zhang, Min Xu

Summary: The paper proposes a multiple kernel ensemble attention method for graph learning, which automatically learns the optimal kernel function and simplifies the hyperparameter tuning process, effectively improving graph learning performance.

KNOWLEDGE-BASED SYSTEMS (2021)

Article Computer Science, Artificial Intelligence

An efficient multitask neural network for face alignment, head pose estimation and face tracking

Jiahao Xia, Haimin Zhang, Shiping Wen, Shuo Yang, Min Xu

Summary: The study introduces an efficient multitask neural network ATPN for face alignment, face tracking, and head pose estimation. ATPN improves performance in face alignment with shortcut connections and enhances other tasks with heatmap fusion, while also saving time in face detection and increasing real-time capability for video tasks.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

Article Computer Science, Artificial Intelligence

Learning Graph Representations Through Learning and Propagating Edge Features

Haimin Zhang, Jiahao Xia, Guoqiang Zhang, Min Xu

Summary: Graph convolutional networks have achieved considerable success in various graph domain tasks. However, the interrelation information between adjacent nodes is not well-considered in these models. This article presents a graph representation learning framework that generates improved node embeddings by learning and propagating edge features. Experimental results show that our model achieves improved performance compared to baseline models.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Robust Face Alignment via Inherent Relation Learning and Uncertainty Estimation

Jiahao Xia, Min Xu, Haimin Zhang, Jianguo Zhang, Wenjian Huang, Hu Cao, Shiping Wen

Summary: This paper proposes a novel face alignment framework, DSLPT, for learning the inherent relation and uncertainty estimation of facial landmarks. Unlike most existing methods that regress facial landmarks directly from global features, DSLPT first generates a rough representation of each landmark from a local patch and then adaptively aggregates them by a case dependent inherent relation. Moreover, a coarse-to-fine framework is introduced to gradually converge to the target facial landmarks.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

SSFG: Stochastically Scaling Features and Gradients for Regularizing Graph Convolutional Networks

Haimin Zhang, Min Xu, Guoqiang Zhang, Kenta Niwa

Summary: This paper proposes a stochastic regularization method to tackle the oversmoothing issue in graph convolutional networks. By stochastically scaling features and gradients, the convergence of features is alleviated and overall performance is improved. Experimental results on seven benchmark datasets for four graph-based tasks demonstrate the effectiveness of the method.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Information Systems

Weakly Supervised Emotion Intensity Prediction for Recognition of Emotions in Images

Haimin Zhang, Min Xu

Summary: Recognition of emotions in images is a topic of increasing research interest, and recent studies suggest that utilizing local region information can enhance recognition performance. A deep neural network structure leveraging emotion intensity learning has been proposed, showing improved performance in image emotion recognition.

IEEE TRANSACTIONS ON MULTIMEDIA (2021)

Proceedings Paper Imaging Science & Photographic Technology

INFRARED TARGET DETECTION USING INTENSITY SALIENCY AND SELF-ATTENTION

Ruiheng Zhang, Min Xu, Yaxin Shi, Jian Fan, Chengpo Mu, Lixin Xu

2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) (2020)

Article Computer Science, Information Systems

A Survey on Machine Learning Techniques for Cyber Security in the Last Decade

Kamran Shaukat, Suhuai Luo, Vijay Varadharajan, Ibrahim A. Hameed, Min Xu

IEEE ACCESS (2020)

Article Computer Science, Artificial Intelligence

LSTM-Cubic A * -based auxiliary decision support system in air traffic management

Zhiyuan Shi, Quan Pan, Min Xu

NEUROCOMPUTING (2020)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)