4.7 Article

Eye tracking data guided feature selection for image classification

期刊

PATTERN RECOGNITION
卷 63, 期 -, 页码 56-70

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2016.09.007

关键词

Eye tracking; Feature selection; Quantum genetic algorithm (QGA); mRMR; SVM-RFE

资金

  1. National Natural Science Foundation of China [60871086, 61473243]
  2. Natural Science Foundation of Jiangsu Province China [BK2008159]
  3. Natural Science Foundation of Suzhou [SYG201113]

向作者/读者索取更多资源

Feature selection has played a critical role in image classification, since it is able to remove irrelevant and redundant features and to eventually reduce the dimensionality of feature space. Although existing feature selection methods have achieved promising progress, human factors have seldom been taken into account. To tackle such a problem, a novel two-stage feature selection method is proposed for image classification by taking human factors into account and leveraging the value of eye tracking data. In the coarse selection stage, with the help of eye tracking data, Regions of Interests (ROIs) from the human perspective are first identified to represent an image with visual features. Then, with an improved quantum genetic algorithm (IQGA) that incorporates a novel mutation strategy for alleviating the premature convergence, a subset of features is obtained for the subsequent fine selection. In the fine selection stage, a hybrid method is proposed to integrate the efficiency of the minimal-Redundancy Maximal-Relevance (mRMR) and the effectiveness of the Support Vector Machine based Recursive Feature Elimination (SVM-RFE). In particular, the ranking criterion of the SVM-RFE is improved by incorporating the ranking information obtained from the mRMR. Comprehensive experimental results in two benchmark datasets demonstrate that eye tracking data are of great importance to improve the performance of feature selection for image classification. (C) 2016 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Interdisciplinary Applications

Short-Term Lesion Change Detection for Melanoma Screening With Novel Siamese Neural Network

Boyan Zhang, Zhiyong Wang, Junbin Gao, Chantal Rutjes, Kaitlin Nufer, Dacheng Tao, David Dagan Feng, Scott W. Menzies

Summary: Short-term monitoring of lesion changes in melanoma screening is currently heavily dependent on individual clinicians' experience and bias, leading to subjective decisions. This paper introduces a novel deep learning-based method for automatically detecting short-term lesion changes, using a Siamese structure and Tensorial Regression Process to improve accuracy. Experimental results on a large dataset show promising results for objective melanoma screening.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2021)

Article Computer Science, Hardware & Architecture

Enhanced Local and Global Learning for Rotation-Invariant Point Cloud Representation

Ruibin Gu, Qiuxia Wu, Yuqiong Li, Wenxiong Kang, Wing W. Y. Ng, Zhiyong Wang

Summary: In this paper, a novel rotation-invariant network named ELGANet is proposed to tackle the issues of rotation disturbance and insufficient labeled data. The ELGANet includes enhanced local representation learning module and global alignment module to capture geometric relationship and adaptively generate rotation-invariant coordinates. Besides, an unsupervised learning network ELGANet-U is also introduced to generate discriminative and rotation-invariant representation without human supervision.

IEEE MULTIMEDIA (2022)

Article Computer Science, Artificial Intelligence

Graph Convolutional Dictionary Selection With L2,p Norm for Video Summarization

Mingyang Ma, Shaohui Mei, Shuai Wan, Zhiyong Wang, Xian-Sheng Hua, David Dagan Feng

Summary: In this paper, a general framework called graph convolutional dictionary selection with L-2, L-p (0 < p <= 1) norm (GCDS(2,p)) is proposed for both keyframe selection and skimming based summarization in video summarization. The structured information in videos is taken into account by incorporating graph embedding into dictionary selection. L-2, L-p (0 < p <= 1) norm constrained row sparsity with flexible p values is used for selecting diverse and representative keyframes or key shots.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Engineering, Electrical & Electronic

Vision-Enhanced and Consensus-Aware Transformer for Image Captioning

Shan Cao, Gaoyun An, Zhenxing Zheng, Zhiyong Wang

Summary: In this paper, a Vision-enhanced and Consensus-aware Transformer (VCT) is proposed for image captioning. The model extends the self-attention module and introduces memory-based attention and visual perception modules to enhance visual representation of images. Consensus knowledge is learned through word correlation graph and graph convolutional network. Experimental results demonstrate state-of-the-art performance on two benchmark datasets.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Review Agronomy

Weed detection to weed recognition: reviewing 50 years of research to identify constraints and opportunities for large-scale cropping systems

Guy R. Y. Coleman, Asher Bender, Kun Hu, Shaun M. Sharpe, Arnold W. Schumann, Zhiyong Wang, Muthukumar V. Bagavathiannan, Nathan S. Boyd, Michael J. Walsh

Summary: Advances in weed recognition technologies over the past 50 years have provided the necessary performance for site-specific weed control in large-scale production systems. These technologies offer improved management of diverse weed morphology and enable the use of nonselective weed control options such as lasers and electrical weeding. Recent research has focused on computer vision techniques and deep convolutional neural network (CNN) approaches for weed recognition.

WEED TECHNOLOGY (2022)

Article Computer Science, Interdisciplinary Applications

Adversarial Evolving Neural Network for Longitudinal Knee Osteoarthritis Prediction

Kun Hu, Wenhua Wu, Wei Li, Milena Simic, Albert Zomaya, Zhiyong Wang

Summary: A novel deep learning architecture, A-ENN, is proposed for longitudinal grading of knee osteoarthritis (KOA) severity. By obtaining evolution traces through an adversarial training scheme, the fine-grained domain knowledge is fused with general convolutional image representations, achieving longitudinal grading.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2022)

Article Computer Science, Artificial Intelligence

A Sparse Framework for Robust Possibilistic K-Subspace Clustering

Shan Zeng, Xiangjun Duan, Hao Li, Jun Bai, Yuanyan Tang, Zhiyong Wang

Summary: In this article, a novel robust and sparse possibilistic K-subspace (RSPKS) clustering algorithm is proposed to handle clustering of noisy, high-dimensional, and structurally complex data. The algorithm integrates subspace recovery and possibilistic clustering algorithms under a unified sparse framework to effectively deal with the adverse impact of noisy samples and complex data structures. Experimental results on both synthetic and real-world datasets demonstrate that the proposed method outperforms state-of-the-art algorithms in terms of clustering accuracy.

IEEE TRANSACTIONS ON FUZZY SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Graph Fusion Network-Based Multimodal Learning for Freezing of Gait Detection

Kun Hu, Zhiyong Wang, Kaylena A. Ehgoetz Martens, Markus Hagenbuchner, Mohammed Bennamoun, Ah Chung Tsoi, Simon J. G. Lewis

Summary: This study proposes a multimodal learning-based FoG detection method using a graph fusion neural network (GFN) that combines footstep pressure maps and video recordings. The GFN constructs multimodal graphs to reduce redundancy among different modalities and achieves superior performance. Experimental results show promising FoG detection with an AUC of 0.882.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Interdisciplinary Applications

Cascade Multi-Level Transformer Network for Surgical Workflow Analysis

Wenxi Yue, Hongen Liao, Yong Xia, Vincent Lam, Jiebo Luo, Zhiyong Wang

Summary: This paper proposes a Cascade Multi-Level Transformer Network (CMTNet) for recognizing surgical phases, and introduces the Adaptive Multi-Level Context Aggregation (AMCA) modules. Through the gradual enrichment of multi-level semantics and the refinement of key context, CMTNet achieves more accurate phase prediction.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2023)

Article Computer Science, Information Systems

Multi-Level Adversarial Spatio-Temporal Learning for Footstep Pressure Based FoG Detection

Kun Hu, Shaohui Mei, Wei Wang, Kaylena A. Ehgoetz Martens, Liang Wang, Simon J. G. Lewis, David D. Feng, Zhiyong Wang

Summary: Freezing of gait (FoG) is a common symptom of Parkinson's disease, and a computer-aided detection and quantification tool for FoG is important for improving treatment quality. Footstep pressure sequences obtained from pressure sensitive gait mats provide a non-invasive way to evaluate FoG, and the proposed Adversarial Spatio-temporal Network (ASTN) is a novel deep learning architecture that can learn FoG patterns and achieve robust detection. In experiments, ASTN outperformed conventional learning methods with an AUC of 0.85.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Confidence-Calibrated Face Image Forgery Detection with Contrastive Representation Distillation

Puning Yang, Huaibo Huang, Zhiyong Wang, Aijing Yu, Ran He

Summary: In this paper, a novel contrastive distillation calibration (CDC) framework is proposed to address the issue of model generalization in face forgery detection. The framework distills contrastive representations with confidence calibration. A dual-teacher module is devised to separately learn knowledge for each forgery type, and a contrastive representation learning strategy is presented to enhance diverse forgery artifacts. Moreover, label smoothing is introduced to calibrate the model confidence with the target outputs.

COMPUTER VISION - ACCV 2022, PT IV (2023)

Article Chemistry, Physical

Asymmetric alkyl-alkyl cross-coupling enabled by earth-abundant metal-catalyzed hydroalkylations of olefins

Peng-Fei Yang, Wei Shu

Summary: Stereogenic carbon centers with C(sp3)-C(sp3) bonds are widely present in natural products, bioactive targets, and chiral organic materials. Transition-metal-catalyzed C(sp3)-C(sp3) bond-forming processes offer a promising solution to generate such stereogenic centers. Recent progress in the in situ formation of alkyl metallic reagents enabled by hydrometallation of olefins for asymmetric alkyl-alkyl cross-coupling is highlighted. Mechanistic considerations, challenges, and future efforts in asymmetric hydroalkylation of olefins are also discussed.

CHEM CATALYSIS (2023)

Article Computer Science, Artificial Intelligence

Higher Order Polynomial Transformer for Fine-Grained Freezing of Gait Detection

Renfei Sun, Kun Hu, Kaylena A. Ehgoetz Martens, Markus Hagenbuchner, Ah Chung Tsoi, Mohammed Bennamoun, Simon J. G. Lewis, Zhiyong Wang

Summary: Freezing of Gait (FoG) is a common symptom of Parkinson's disease and machine learning-based methods can effectively detect it. This article proposes a novel deep learning architecture called higher order polynomial transformer (HP-Transformer) for fine-grained FoG detection based on vision inputs. The proposed method incorporates pose and appearance feature sequences and achieves an AUC of 0.92 for FoG detection.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Action Recognition With Motion Diversification and Dynamic Selection

Peiqin Zhuang, Yu Guo, Zhipeng Yu, Luping Zhou, Lei Bai, Ding Liang, Zhiyong Wang, Yali Wang, Wanli Ouyang

Summary: Motion modeling plays a crucial role in modern action recognition methods. However, variations in motion dynamics across different video clips present a challenge in adaptively covering proper motion information. In this paper, we propose a Motion Diversification and Selection (MoDS) module that generates diversified spatio-temporal motion features and dynamically selects the suitable motion representation for categorizing input videos. Our method achieves state-of-the-art performance on benchmarks with large motion variations.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Artificial Intelligence

Affective Audio Annotation of Public Speeches with Convolutional Clustering Neural Network

Jiahao Xu, Boyan Zhang, Zhiyong Wang, Yang Wang, Fang Chen, Junbin Gao, David Dagan Feng

Summary: Public speaking is a crucial skill in daily communication. The lack of personalized feedback hinders the improvement of this skill, even with more practice. This research proposes a novel convolutional clustering neural network (CCNN) to solve the problem of personalized feedback by learning from online public speech videos. Experimental results on a self-built affective audio annotation dataset show that our proposed method outperforms traditional CNN-based approaches, achieving better affective annotation with a lower hamming loss.

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING (2022)

Article Computer Science, Artificial Intelligence

Exploiting sublimated deep features for image retrieval

Guang-Hai Liu, Zuo-Yong Li, Jing-Yu Yang, David Zhang

Summary: This article introduces a novel image retrieval method that improves retrieval performance by using sublimated deep features. The method incorporates orientation-selective features and color perceptual features, effectively mimicking these mechanisms to provide a more discriminating representation.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Region-adaptive and context-complementary cross modulation for RGB-T semantic segmentation

Fengguang Peng, Zihan Ding, Ziming Chen, Gang Wang, Tianrui Hui, Si Liu, Hang Shi

Summary: RGB-Thermal (RGB-T) semantic segmentation is an emerging task that aims to improve the robustness of segmentation methods under extreme imaging conditions by using thermal infrared modality. The challenges of foreground-background distinguishment and complementary information mining are addressed by proposing a cross modulation process with two collaborative components. Experimental results show that the proposed method achieves state-of-the-art performances on current RGB-T segmentation benchmarks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

F-SCP: An automatic prompt generation method for specific classes based on visual language pre-training models

Baihong Han, Xiaoyan Jiang, Zhijun Fang, Hamido Fujita, Yongbin Gao

Summary: This paper proposes a novel automatic prompt generation method called F-SCP, which focuses on generating accurate prompts for low-accuracy classes and similar classes. Experimental results show that our approach outperforms state-of-the-art methods on six multi-domain datasets.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Residual Deformable Convolution for better image de-weathering

Huikai Liu, Ao Zhang, Wenqian Zhu, Bin Fu, Bingjian Ding, Shengwu Xiong

Summary: Adverse weather conditions present challenges for computer vision tasks, and image de-weathering is an important component of image restoration. This paper proposes a multi-patch skip-forward structure and a Residual Deformable Convolutional module to improve feature extraction and pixel-wise reconstruction.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

A linear transportation LP distance for pattern recognition

Oliver M. Crook, Mihai Cucuringu, Tim Hurst, Carola-Bibiane Schonlieb, Matthew Thorpe, Konstantinos C. Zygalakis

Summary: The transportation LP distance (TLP) is a generalization of the Wasserstein WP distance that can be applied directly to color or multi-channelled images, as well as multivariate time-series. TLP interprets signals as functions, while WP interprets signals as measures. Although both distances are powerful tools in modeling data with spatial or temporal perturbations, their computational cost can be prohibitively high for moderate pattern recognition tasks. The linear Wasserstein distance offers a method for projecting signals into a Euclidean space, and in this study, we propose linear versions of the TLP distance (LTLP) that show significant improvement over the linear WP distance in signal processing tasks while being several orders of magnitude faster to compute than the TLP distance.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Learning a target-dependent classifier for cross-domain semantic segmentation: Fine-tuning versus meta-learning

Haitao Tian, Shiru Qu, Pierre Payeur

Summary: This paper proposes a method of target-dependent classifier, which optimizes the joint hypothesis of domain adaptation into a target-dependent hypothesis that better fits with the target domain clusters through an unsupervised fine-tuning strategy and the concept of meta-learning. Experimental results demonstrate that this method outperforms existing techniques in synthetic-to-real adaptation and cross-city adaptation benchmarks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

KGSR: A kernel guided network for real-world blind super-resolution

Qingsen Yan, Axi Niu, Chaoqun Wang, Wei Dong, Marcin Wozniak, Yanning Zhang

Summary: Deep learning-based methods have achieved remarkable results in the field of super-resolution. However, the limitation of paired training image sets has led researchers to explore self-supervised learning. However, the assumption of inaccurate downscaling kernel functions often leads to degraded results. To address this issue, this paper introduces KGSR, a kernel-guided network that trains both upscaling and downscaling networks to generate high-quality high-resolution images even without knowing the actual downscaling process.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Gait feature learning via spatio-temporal two-branch networks

Yifan Chen, Xuelong Li

Summary: Gait recognition is a popular technology for identification due to its ability to capture gait features over long distances without cooperation. However, current methods face challenges as they use a single network to extract both temporal and spatial features. To solve this problem, we propose a two-branch network that focuses on spatial and temporal feature extraction separately. By combining these features, we can effectively learn the spatio-temporal information of gait sequences.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

PAMI: Partition Input and Aggregate Outputs for Model Interpretation

Wei Shi, Wentao Zhang, Wei-shi Zheng, Ruixuan Wang

Summary: This article proposes a simple yet effective visualization framework called PAMI, which does not require detailed model structure and parameters to obtain visualization results. It can be applied to various prediction tasks with different model backbones and input formats.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Disturbance rejection with compensation on features

Xiaobo Hu, Jianbo Su, Jun Zhang

Summary: This paper reviews the latest technologies in pattern recognition, highlighting their instabilities and failures in practical applications. From a control perspective, the significance of disturbance rejection in pattern recognition is discussed, and the existing problems are summarized. Finally, potential solutions related to the application of compensation on features are discussed to emphasize future research directions.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

ECLAD: Extracting Concepts with Local Aggregated Descriptors

Andres Felipe Posada-Moreno, Nikita Surya, Sebastian Trimpe

Summary: Convolutional neural networks are widely used in critical systems, and explainable artificial intelligence has proposed methods for generating high-level explanations. However, these methods lack the ability to determine the location of concepts. To address this, we propose a novel method for automatic concept extraction and localization based on pixel-wise aggregations, and validate it using synthetic datasets.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Dynamic Graph Contrastive Learning via Maximize Temporal Consistency

Peng Bao, Jianian Li, Rong Yan, Zhongyi Liu

Summary: In this paper, a novel Dynamic Graph Contrastive Learning framework, DyGCL, is proposed to capture the temporal consistency in dynamic graphs and achieve good performance in node representation learning.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

ConvGeN: A convex space learning approach for deep-generative oversampling and imbalanced classification of small tabular datasets

Kristian Schultz, Saptarshi Bej, Waldemar Hahn, Markus Wolfien, Prashant Srivastava, Olaf Wolkenhauer

Summary: Research indicates that deep generative models perform poorly compared to linear interpolation-based methods for synthetic data generation on small, imbalanced tabular datasets. To address this, a new approach called ConvGeN, combining convex space learning with deep generative models, has been proposed. ConvGeN improves imbalanced classification on small datasets while remaining competitive with existing linear interpolation methods.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

H-CapsNet: A capsule network for hierarchical image classification

Khondaker Tasrif Noor, Antonio Robles-Kelly

Summary: In this paper, the authors propose H-CapsNet, a capsule network designed for hierarchical image classification. The network effectively captures hierarchical relationships using dedicated capsules for each class hierarchy. A modified hinge loss is utilized to enforce consistency among the involved hierarchies. Additionally, a strategy for dynamically adjusting training parameters is presented to achieve better balance between the class hierarchies. Experimental results demonstrate that H-CapsNet outperforms competing hierarchical classification networks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

CS-net: Conv-simpleformer network for agricultural image segmentation

Lei Liu, Guorun Li, Yuefeng Du, Xiaoyu Li, Xiuheng Wu, Zhi Qiao, Tianyi Wang

Summary: This study proposes a new agricultural image segmentation model called CS-Net, which uses Simple-Attention Block and Simpleformer to improve accuracy and inference speed, and addresses the issue of performance collapse of Transformers in agricultural image processing.

PATTERN RECOGNITION (2024)