☆ 4.6 Article

Adaptive road detection via context-aware label transfer

NEUROCOMPUTING (2015)

Journal

NEUROCOMPUTING

Volume 158, Issue -, Pages 174-183

Publisher

ELSEVIER

DOI: 10.1016/j.neucom.2015.01.054

Keywords

Computer vision; Road detection; Depth map; Label transfer; Context-aware; MRF

Categories

Computer Science, Artificial Intelligence

Funding

State Key Program of National Natural Science of China [61232010]
National Natural Science Foundation of China [61172143, 61379094, 61105012]
Fundamental Research Funds for the Central Universities [3102014JC02020G07]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

The vision ability is fundamentally important for a mobile robot. Many aspects have been investigated during the past few years, but there still remain questions to be answered. This work mainly focuses on the task of road detection, which is considered as the first step for a robot to become moveable. The proposed method combines the depth clue with traditional RGB information and is divided into three steps: depth recovery and superpixel generation, weakly supervised SVM classification and context-aware label transfer. The main contributions made in this paper are (I) Design a novel superpixel based context-aware descriptor by utilizing depth map. (2) Conduct label transfer in an efficient nearest neighbor search and a temporal MRF model. (3) Update the learned model adaptively with the changing scene. Experimental results on a publicly available dataset justify the effectiveness of the proposed method. (C) 2015 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6

Not enough ratings

Secondary Ratings

Novelty

-

Significance

-

Scientific rigor

-

Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

In Pursuit of Beauty: Aesthetic-Aware and Context-Adaptive Photo Selection in Crowdsensing

Tongqing Zhou, Zhiping Cai, Fang Liu, Jinshu Su

Summary: Propose a novel photo selection framework with adaptive aesthetic awareness for crowdsensing, which actively learns contextual knowledge for dynamically tailoring the aesthetic predictor. Extensive experiments demonstrate the performance superiority of CrowdPicker over baselines and sampling strategies.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Add to Collection

Article Robotics

Perspective Aware Road Obstacle Detection

Krzysztof Lis, Sina Honari, Pascal Fua, Mathieu Salzmann

Summary: This study combines road obstacle detection techniques with perspective information to address the issue of diminishing apparent size of obstacles as their distance from the vehicle increases. The results demonstrate that the combination of these two strategies significantly improves obstacle detection performance and outperforms existing methods in terms of instance-level obstacle detection.

IEEE ROBOTICS AND AUTOMATION LETTERS (2023)

Add to Collection

Article Robotics

Diversity-Aware Label Distribution Learning for Microscopy Auto Focusing

Chuyan Zhang, Yun Gu, Jie Yang, Guang-Zhong Yang

Summary: The proposed diversity-aware learning framework predicts the optimal focus position based on a single image, utilizing a two-point representation of distance for label distribution learning and an intra-class discrepancy penalty term to reduce pathology slide variation. Experiments show promising results in accuracy, real-time performance, and generalization, outperforming previous no-reference approaches by 39%.

IEEE ROBOTICS AND AUTOMATION LETTERS (2021)

Add to Collection

Article Computer Science, Artificial Intelligence

Spatially-Aware Context Neural Networks

Dongsheng Ruan, Yu Shi, Jun Wen, Nenggan Zheng, Min Zheng

Summary: This study introduces a novel SAC block for learning spatially-aware contexts to improve long-range dependency modeling, with extensive experiments showing significant performance improvements in various computer vision tasks.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)

Add to Collection

Article Computer Science, Artificial Intelligence

Context-Aware Path Ranking in Road Networks

Sean Bin Yang, Chenjuan Guo, Bin Yang

Summary: Ranking paths in transportation services is an important functionality, and this study proposes a regression modeling approach to assign ranking scores to paths based on historical trajectories. The study introduces effective training data enrichment and a multi-task learning framework to improve the ranking estimation. Empirical studies validate the effectiveness and practicality of the proposed framework.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2022)

Add to Collection

Article Robotics

Striving for Less: Minimally-Supervised Pseudo-Label Generation for Monocular Road Segmentation

Francois Robinet, Yussef Akl, Kaleem Ullah, Farzad Nozarian, Christian Mueller, Raphael Frank

Summary: Identifying traversable space is a crucial problem in autonomous robot navigation, and recent research has focused on unsupervised and semi-supervised approaches to reduce annotation costs. This study proposes a practical and minimally-supervised method for monocular road segmentation, utilizing task-specific feature extraction and pseudo-labeling. The results show that even minimal labeling efforts can greatly improve the performance, demonstrating the pragmatic approach to labeling.

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Add to Collection

Article Computer Science, Information Systems

A topography-aware approach to the automatic generation of urban road networks

Zhou Fang, Jiaxin Qi, Lubin Fan, Jianqiang Huang, Ying Jin, Tianren Yang

Summary: Existing deep-learning tools for road network generation have limitations in flat urban areas. This paper proposes a new method that combines geometric configurations and topographic features to automate street network generation in both flat and hilly urban areas. The improved model shows more realistic predictions and is more effective in generating streets in hilly areas. The geo-extractor module provides insights in recognizing and considering topographic information.

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE (2022)

Add to Collection

Article Automation & Control Systems

Confidence-Aware Paced-Curriculum Learning by Label Smoothing for Surgical Scene Understanding

Mengya Xu, Mobarakol Islam, Ben Glocker, Hongliang Ren

Summary: Curriculum learning and self-paced learning are effective training strategies in robotic vision. Most existing works focus on designing curricula based on difficulty levels, but the approach of smoothing labels for learning control is unexplored. In this work, a paced curriculum by label smoothing (P-CBLS) is proposed for classification and semantic segmentation tasks. The proposed method improves prediction accuracy and robustness.

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING (2023)

Add to Collection

Article Computer Science, Information Systems

Relevant Visual Semantic Context-Aware Attention-Based Dialog

Eugene Tan Boon Hong, Yung-Wey Chong, Tat-Chee Wan, Kok-Lim Alvin Yau

Summary: This paper proposes a new visual dialog dataset called DS-Dialog to address the challenges in obtaining sufficient context and overcoming visual semantic limitations faced by the existing dataset. DS-Dialog enhances the current dataset by adding new context-aware relevant history to provide more visual semantic context for each image. The proposed DS-Dialog model achieves better performance compared to previous models, demonstrating the importance of relevant semantic historical context in enhancing the visual semantic relationship between textual and visual representations.

CMC-COMPUTERS MATERIALS & CONTINUA (2023)

Add to Collection

Article Engineering, Civil

Using Vision Transformers for Spatial-Context-Aware Rain and Road Surface Condition Detection on Freeways

Amr Abdelraouf, Mohamed Abdel-Aty, Yina Wu

Summary: This paper proposes a novel vision-based method to detect rain and road surface conditions using roadside traffic cameras. By utilizing vision transformers for image-based classification and leveraging the geographical distribution of roadside cameras and a spatial self-attention network, the detection model's accuracy is enhanced.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2022)

Add to Collection

Article Automation & Control Systems

Vision Based Hand Gesture Recognition Using 3D Shape Context

Chen Zhu, Jianyu Yang, Zhanpeng Shao, Chunping Liu

Summary: The paper introduces a new method for hand gesture recognition using depth maps and 3D shape context descriptors. Experimental results show that the proposed method is robust to noise, articulated variations, and rigid transformations, outperforming current state-of-the-art methods in accuracy and efficiency.

IEEE-CAA JOURNAL OF AUTOMATICA SINICA (2021)

Add to Collection

Article Chemistry, Analytical

Attention-Based Context Aware Network for Semantic Comprehension of Aerial Scenery

Weipeng Shi, Wenhu Qin, Zhonghua Yun, Peng Ping, Kaiyang Wu, Yuke Qu

Summary: This study introduces an end-to-end semantic segmentation model for aerial images based on HRNET, addressing two challenges in RSIs semantic segmentation with the incorporation of CRAM and CCFM modules. Experimental results show that the model improves accuracy and outperforms some commonly used CNN architectures.

SENSORS (2021)

Add to Collection

Article Computer Science, Information Systems

Quality-Aware Network for Human Parsing

Lu Yang, Qing Song, Zhihui Wang, Zhiwei Liu, Songcen Xu, Zhihao Li

Summary: This study proposes a statistical method based on the output probability map to estimate the quality of network output for human parsing. It introduces a Quality-Aware Module (QAM) and a Quality-Aware Network (QANet) to improve the quality of human parsing results. The method achieves the best performance on multiple benchmarks and has the potential for application in other tasks.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Monocular Road Planar Parallax Estimation

Haobo Yuan, Teng Chen, Wei Sui, Jiafeng Xie, Lefei Zhang, Yuan Li, Qian Zhang

Summary: This paper proposes a novel deep neural network called RPANet for 3D sensing from monocular image sequences based on planar parallax. RPANet takes a pair of images aligned by the homography of the road plane as input and outputs a height-to-depth ratio map for 3D reconstruction. The map can be combined with the road plane as a reference to estimate the 3D structure by warping the consecutive frames. The effectiveness of the method is verified through comprehensive experiments on the Waymo Open Dataset.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Spatial Context-Aware Object-Attentional Network for Multi-Label Image Classification

Jialu Zhang, Jianfeng Ren, Qian Zhang, Jiang Liu, Xudong Jiang

Summary: In this paper, a multi-branch deep neural network is proposed for multi-label image classification. It effectively utilizes label-related semantic information, background context, and spatial semantic information to better detect target objects. Experimental results show that the proposed method outperforms the state-of-the-art methods for multi-label image classification.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Add to Collection

Article Geochemistry & Geophysics

Edge Neighborhood Contrastive Learning for Building Change Detection

Mingwei Zhang, Qiang Li, Yuan Yuan, Qi Wang

Summary: This article proposes a building change detection method based on deep learning, which effectively addresses the challenges of temporal-spatial correlation and discrimination in the neighborhood of the edge by introducing a selective attention module and a contrastive learning method. The experimental results show that the proposed method achieves competitive performance in terms of objective metrics and visual comparisons.

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS (2023)

Add to Collection

Article Engineering, Electrical & Electronic

Boosting One-Stage License Plate Detector via Self-Constrained Contrastive Aggregation

Haoxuan Ding, Junyu Gao, Yuan Yuan, Qi Wang

Summary: Scene Text Detection (STD) has been successfully applied in various fields. One important application is License Plate Detection (LPD). License Plate (LP) serves as a unique identifier for vehicles, facilitating intelligent transportation in areas such as traffic enforcement and dispatching. However, similar scene texts often lead to misjudgment by LP detectors. To address this issue, more discriminative features are required. In this study, we propose a Self-Constrained Contrastive Aggregation (SCCA) method to aggregate features in the latent space and improve the feature expression of the backbone. Experimental results demonstrate that SCCA significantly improves the baseline performance and outperforms recent LP detectors, achieving a 99.7 F1-score and AP on the UFPR-ALPR dataset. Additionally, a comparison between self-constrained contrastive learning and vanilla contrastive learning confirms the superiority of SCCA and the reasonability of our assumptions.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Add to Collection

Article Geochemistry & Geophysics

CoF-Net: A Progressive Coarse-to-Fine Framework for Object Detection in Remote-Sensing Imagery

Cong Zhang, Kin-Man Lam, Qi Wang

Summary: In this article, a novel coarse-to-fine framework (CoF-Net) is proposed for object detection in remote-sensing imagery, which aims to improve the performance of existing object detectors by progressively enhancing feature representation and selecting stronger training samples. CoF-Net smoothly refines the original coarse features into multispectral nonlocal fine features with discriminative spatial-spectral details and semantic relations, and dynamically considers samples from coarse to fine during training by introducing geometric and classification constraints. Comprehensive experiments demonstrate the effectiveness and superiority of the proposed method.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

LSV-LP: Large-Scale Video-Based License Plate Detection and Recognition

Qi Wang, Xiaocheng Lu, Cong Zhang, Yuan Yuan, Xuelong Li

Summary: This paper constructs a large-scale video-based license plate dataset named LSV-LP, and proposes a new framework called MFLPR-Net to improve the performance of license plate detection and recognition systems.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Add to Collection

Article Geography, Physical

Semi-supervised bidirectional alignment for Remote Sensing cross-domain scene classification

Wei Huang, Yilei Shi, Zhitong Xiong, Qi Wang, Xiao Xiang Zhu

Summary: RS image scene classification has gained attention for its applications. Conventional supervised approaches require labeled data, but with more RS images available, utilizing unlabeled data becomes urgent. This paper proposes a SSDA method called BSCA for RS cross-domain scene classification, using unsupervised and supervised alignment strategies to reduce domain shift.

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (2023)

Add to Collection

Article Geochemistry & Geophysics

LGNet: Location-Guided Network for Road Extraction From Satellite Images

Jingtao Hu, Junyu Gao, Yuan Yuan, Jocelyn Chanussot, Qi Wang

Summary: The study proposes a location-guided network (LGNet) to improve connectivity performance in road extraction. By adding an auxiliary road location prediction (RLP) task, LGNet obtains global road connectivity information and enhances road segmentation performance. The features are guided by the global location context using a location-guided decoder (LG-Decoder) to capture the connectivity of each road segment.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Add to Collection

Article Engineering, Civil

An End-to-End Contrastive License Plate Detector

Haoxuan Ding, Junyu Gao, Yuan Yuan, Qi Wang

Summary: License Plate (LP) is crucial for intelligent transportation, and a contrastive learning method is proposed to improve the detection accuracy. The Contrastive License Plate Detector (CLPD) employs a contrastive triad to decouple the foregrounds and backgrounds, and a contrastive learning branch is introduced to enhance the feature expression ability and extract discriminative features. The CLPD achieves significant improvements compared to baselines and other license plate detectors on multiple datasets.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Add to Collection

Article Geochemistry & Geophysics

Efficient Inductive Vision Transformer for Oriented Object Detection in Remote Sensing Imagery

Cong Zhang, Jingran Su, Yakun Ju, Kin-Man Lam, Qi Wang

Summary: A novel efficient inductive vision Transformer framework is proposed for oriented object detection in remote sensing imagery. The framework fully explores spatial redundancy and utilizes an adaptive multigrained routing mechanism to reduce computational cost without compromising accuracy. It also incorporates a compact dual-path encoding architecture and an angle tokenization technique to enhance inductive bias and promote the encoding and learning of directional knowledge. Comprehensive experiments demonstrate the effectiveness and superiority of the proposed framework for oriented object detection in remote sensing images.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Add to Collection

Article Geochemistry & Geophysics

Uncertainty-Aware Graph Reasoning With Global Collaborative Learning for Remote Sensing Salient Object Detection

Yanfeng Liu, Yuan Yuan, Qi Wang

Summary: In this paper, a hybrid modeling approach called the uncertainty-aware graph reasoning with global collaborative learning (UG2L) framework is proposed to address the accurate detection problem of salient objects in optical remote sensing images (RSIs) with complex edges and irregular topology. The proposed method models the intricate relations among RSI patches using graph representations. Additionally, a global context block with a linear attention mechanism is introduced to explore the multiscale and global context collaboratively, and an uncertainty-aware loss is designed to enhance the model's reliability for better saliency prediction.

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS (2023)

Add to Collection

Article Geochemistry & Geophysics

Transcending Pixels: Boosting Saliency Detection via Scene Understanding From Aerial Imagery

Yanfeng Liu, Zhitong Xiong, Yuan Yuan, Qi Wang

Summary: Existing remote sensing image salient object detection methods often focus on pixel-level supervision and overlook image-level scene information. In this study, we annotate image-level scene labels for three datasets and propose a scene-guided dual-branch network (SDNet) that performs cross-task knowledge distillation to improve saliency detection accuracy.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Add to Collection

Article Geochemistry & Geophysics

Exploring Hard Samples in Multiview for Few-Shot Remote Sensing Scene Classification

Yuyu Jia, Junyu Gao, Wei Huang, Yuan Yuan, Qi Wang

Summary: This paper proposes a multiview integration network (HSL-MINet) to tackle the few-shot learning problem in remote sensing scene classification. The network enhances the model's generalization ability and discriminative power of the decision boundary through a multiview integration module and a hard sample learning module. Extensive experiments on multiple datasets demonstrate that HSL-MINet outperforms previous state-of-the-art methods.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Add to Collection

Article Geochemistry & Geophysics

Distilling Knowledge From Super-Resolution for Efficient Remote Sensing Salient Object Detection

Yanfeng Liu, Zhitong Xiong, Yuan Yuan, Qi Wang

Summary: This study proposes a universal super-resolution assisted learning (SRAL) framework to improve the performance and efficiency of existing remote sensing salient object detectors. The framework includes a transposed saliency detection decoder (TSDD), an auxiliary SR decoder (ASRD), and a task-fusion guidance module (TFGM). Experimental results on three datasets demonstrate that SRAL outperforms more than 20 algorithms and can be applied to existing networks.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Add to Collection

Article Geochemistry & Geophysics

Holistic Mutual Representation Enhancement for Few-Shot Remote Sensing Segmentation

Yuyu Jia, Junyu Gao, Wei Huang, Yuan Yuan, Qi Wang

Summary: This article proposes a holistic mutual representation enhancement method for few-shot segmentation, addressing the issues of intra-class variations and background interference. Extensive experiments demonstrate the superiority of the proposed method and a corresponding benchmark dataset is created.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Add to Collection

Article Engineering, Electrical & Electronic

Boosting Object Detectors via Strong-Classification Weak-Localization Pretraining in Remote Sensing Imagery

Cong Zhang, Tianshan Liu, Jun Xiao, Kin-Man Lam, Qi Wang

Summary: This article proposes a novel pretraining paradigm specifically for RS object detection, which significantly improves detection performance and outperforms traditional classification pretraining methods. The method generates pseudo bounding boxes on a reconstructed RS classification-style dataset and integrates them with accurate class labels as location- and category-related supervisions for pretraining the RS detectors.

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT (2023)

Add to Collection

Article Geochemistry & Geophysics

Multiscale Factor Joint Learning for Hyperspectral Image Super-Resolution

Qiang Li, Yuan Yuan, Qi Wang

Summary: This paper proposes a multiscale factor joint learning method for hyperspectral image super-resolution, which effectively explores the interdependence among different scale factors and optimizes the representation of spatial and spectral contents through information interaction and feedback correction, achieving superior performance.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Add to Collection

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)

Add to Collection

© Peeref 2019-2024. All rights reserved.