☆ 4.7 Article

Understanding human intention by connecting perception and action learning in artificial agents

NEURAL NETWORKS (2017)

Journal

NEURAL NETWORKS

Volume 92, Issue -, Pages 29-38

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.neunet.2017.01.009

Keywords

Object-Augmented Multiple Timescale Recurrent Neural Network; Perception-action connected learning; Intention understanding; Human-robot interaction; Cognitive agent; Affordance

Categories

Computer Science, Artificial Intelligence Neurosciences

Funding

Industrial Strategic Technology Development Program - Ministry of Trade, Industry and Energy (MOTIE, Korea) [10044009]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

To develop an advanced human-robot interaction system, it is important to first understand how human beings learn to perceive, think, and act in an ever-changing world. In this paper, we propose an intention understanding system that uses an Object Augmented-Supervised Multiple Timescale Recurrent Neural Network (OA-SMTRNN) and demonstrate the effects of perception-action connected learning in an artificial agent, which is inspired by psychological and neurological phenomena in humans. We believe that action and perception are not isolated processes in human mental development, and argue that these psychological and neurological interactions can be replicated in a human-machine scenario. The proposed OA-SMTRNN consists of perception and action modules and their connection, which are constructed of supervised multiple timescale recurrent neural networks and the deep auto-encoder, respectively, and connects their perception and action for understanding human intention. Our experimental results show the effects of perception-action connected learning, and demonstrate that robots can understand human intention with OA-SMTRNN through perception-action connected learning. (C) 2017 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

Secondary Ratings

Novelty

-

Significance

-

Scientific rigor

-

Rate this paper

Recommended

Article Robotics

A Scalable Approach to Predict Multi-Agent Motion for Human-Robot Collaboration

Mohammad Samin Yasar, Tariq Iqbal

Summary: A novel sequence learning approach is proposed in this work, which learns a robust representation of human motion and predicts intent, working for various settings and improving human motion understanding.

IEEE ROBOTICS AND AUTOMATION LETTERS (2021)

Add to Collection

Article Geology

Geological symbol recognition on geological map using convolutional recurrent neural network with augmented data

Qinjun Qiu, Yongjian Tan, Kai Ma, Miao Tian, Zhong Xie, Liufeng Tao

Summary: Geological maps contain important geological knowledge and accurately recognizing symbols in these maps is crucial for understanding and analyzing them. This paper proposes a three-stage framework based on deep learning to automatically recognize symbols in geological maps, including dataset construction, CRNN model training, and geo-symbol index construction.

ORE GEOLOGY REVIEWS (2023)

Add to Collection

Article Computer Science, Artificial Intelligence

Task-Oriented Robot Cognitive Manipulation Planning Using Affordance Segmentation and Logic Reasoning

Zhongli Wang, Guohui Tian

Summary: This article proposes a task-oriented robot cognitive manipulation planning method using affordance segmentation and logic reasoning, which can provide robots with semantic reasoning skills to manipulate appropriate parts of an object according to different tasks. The method utilizes a convolutional neural network based on the attention mechanism to obtain object affordance and constructs object/task ontologies for the management of objects and tasks. By establishing object-task affordances through causal probability logic, the method can reason manipulation regions' configuration for the intended task with the help of the Dempster-Shafer theory. Experimental results demonstrate that this method effectively improves the cognitive manipulation ability of robots and enhances their performance in various tasks.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Add to Collection

Article Computer Science, Information Systems

Slim-FCP: Lightweight-Feature-Based Cooperative Perception for Connected Automated Vehicles

Jingda Guo, Dominic Carrillo, Qi Chen, Qing Yang, Song Fu, Hongsheng Lu, Rui Guo

Summary: Cooperative perception is a novel approach to improve driving safety by overcoming the sensing limitation of a single automated vehicle. Existing solutions for cooperative perception use feature maps generated by CNN models, but their large size makes transmission difficult. This study proposes Slim-FCP, a new method that significantly reduces the transmission data size by using a channelwise feature encoder to remove irrelevant features and an intelligent channel selection strategy. Evaluation results show that Slim-FCP reduces transmission data size by 75% compared to the best state-of-the-art solution, with minimal loss in object detection recall.

IEEE INTERNET OF THINGS JOURNAL (2022)

Add to Collection

Article Computer Science, Artificial Intelligence

Lower limb movement intention recognition for rehabilitation robot aided with projected recurrent neural network

Mei Liu, Bo Peng, Mingsheng Shang

Summary: Research on intention recognition of lower limb rehabilitation robot requires consideration of normal movement intentions before improving models to recognize patient intentions. A projected recurrent neural network (PRNN) model has been proposed to address convergence speed limitations of traditional RNN models, showing successful application in experimental intention recognition of lower limb movements.

COMPLEX & INTELLIGENT SYSTEMS (2022)

Add to Collection

Article Computer Science, Information Systems

The Collaborative Interaction with Pokemon-Go Robot uses Augmented Reality technology for Increasing the Intentions of Patronizing Hospitality

Hsiu-Yuan Wang, Jian-Hong Wang, Jie Zhang, Hsing-Wei Tai

Summary: The aim of this study is to investigate the factors influencing Pokemon-Go robot users' intention to patronize hospitality firms using virtual monsters.

INFORMATION SYSTEMS FRONTIERS (2021)

Add to Collection

Article Transportation

Infrastructure sensor-based cooperative perception for early stage connected and automated vehicle deployment

Chenxi Chen, Qing Tang, Xianbiao Hu, Zhitong Huang

Summary: Infrastructure-based sensors are a potential solution to support the adoption of connected and automated vehicle technologies in the early stages. These sensors can significantly enhance the driving context understanding of connected vehicles with lower levels of automation and overcome occlusion and limited sensor range issues. This manuscript proposes a cooperative perception modeling framework that addresses the key technical challenge of time delay in the perception process, using a CTRV model, delay compensation and fusion module, and an UKF algorithm for improved object tracking accuracy considering communication time delay. Simulation experiments show satisfactory results.

JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Add to Collection

Article Pediatrics

Robotic Anxiety-Parents' Perception of Robot-Assisted Pediatric Surgery

Elisabeth Ammer, Laura Sophie Mandt, Isabelle Christine Silbersdorff, Fritz Kahl, York Hagmayer

Summary: Compared to other countries, robot-assisted pediatric surgery is not widely practiced in Germany. This study analyzed parents' intention to choose robot-assisted or laparoscopic surgery for their children, finding that parents were more inclined towards laparoscopic surgery. The perception of more benefits, assumed positive attitude from the social environment, and reduced anxiety increased the intention. The type of surgery influenced intentions through the assumed attitude of the social environment.

CHILDREN-BASEL (2022)

Add to Collection

Article Chemistry, Analytical

Object-of-Interest Perception in a Reconfigurable Rolling-Crawling Robot

Archana Semwal, Melvin Ming Jun Lee, Daniela Sanchez, Sui Leng Teo, Bo Wang, Rajesh Elara Mohan

Summary: This article presents a method for flexible robotic development based on Cebrenus Rechenburgi, which determines appropriate locomotion modes through object-of-interest perception. The authors trained a locomotion mode recognition framework with a self-collected dataset and validated its effectiveness and accuracy through experiments. The results show that the framework can successfully determine the robot's locomotion modes during complex pathways.

SENSORS (2022)

Add to Collection

Article Computer Science, Interdisciplinary Applications

A mixed perception-based human-robot collaborative maintenance approach driven by augmented reality and online deep reinforcement learning

Changchun Liu, Zequn Zhang, Dunbing Tang, Qingwei Nie, Linqi Zhang, Jiaye Song

Summary: This paper proposes a mixed perception-based human-robot collaborative maintenance approach with three-hierarchy structures to address the problems in human-robot collaboration maintenance. The approach includes a perception module for recognizing human safety and maintenance request, a decision-making module for executing robotized maintenance tasks, and an augmented reality-assisted interaction interface for personnel. Comparative experiments in a machining workshop demonstrate the competitive performance of the proposed approach compared with other state-of-the-art methods.

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING (2023)

Add to Collection

Article Automation & Control Systems

Hybrid Recurrent Neural Network Architecture-Based Intention Recognition for Human-Robot Collaboration

Xiaoshan Gao, Liang Yan, Gang Wang, Chris Gerada

Summary: This study proposes a hybrid recurrent neural network architecture for intention recognition in collaborative assembly tasks. By improving the activation functions of LSTM and Bi-LSTM networks, and utilizing them in the hybrid architecture, the prediction performance of intention recognition can be effectively improved.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Add to Collection

Article Construction & Building Technology

Effects of Luminance Contrast on Depth Perception in Optical See-Through Augmented Reality

Jingyuan Wang, Shining Ma, Yue Liu, Yongtian Wang, Weitao Song

Summary: Augmented reality technology presents virtual objects in the real world to assist depth perception. Previous research focused on the impact of virtual object properties, while this study investigates the influence of ambient luminance, virtual object luminance, and shading model on depth perception. Results show that a higher luminance contrast between the real scene and the virtual object leads to decreased accuracy in depth perception in AR.

LEUKOS (2023)

Add to Collection

Article Engineering, Industrial

Augmented reality-assisted gesture-based teleoperated system for robot motion planning

Ahmed Eslam Salman, Magdy Raouf Roman

Summary: The study proposes a human-robot interaction framework to enable remote communication between operators and robots in a simple and intuitive way. The purpose is to reduce stress on operators, increase accuracy, and reduce task completion time. The proposed system is specifically designed for use in radioactive isotope production factories.

INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION (2023)

Add to Collection

Article Robotics

Robust Classification of Grasped Objects in Intuitive Human-Robot Collaboration Using a Wearable Force-Myography Device

Nadav D. Kahanowich, Avishai Sintov

Summary: The research proposed a method using wearable force-myography device to classify objects grasped by humans, improving accuracy through training classifiers and increasing certainty by real-time iterative method. The study demonstrates high accuracy of the method and its ability to enhance the performance of trained classifiers.

IEEE ROBOTICS AND AUTOMATION LETTERS (2021)

Add to Collection

Article Neurosciences

On the Necessity of Recurrent Processing during Object Recognition: It Depends on the Need for Scene Segmentation

Noor Seijdel, Jessica Loke, Ron van de Klundert, Matthew van der Meer, Eva Quispel, Simon van Gaal, Edward H. F. de Haan, H. Steven Scholte

Summary: The study found that recurrent computations are crucial for figure-ground segmentation of objects embedded in complex scenes. Behavioral results, EEG measurements, and deep convolutional neural network performance all support the notion that recurrent processing is essential for recognizing objects in complex backgrounds.

JOURNAL OF NEUROSCIENCE (2021)

Add to Collection

Article Computer Science, Information Systems

Depth map prediction from a single image with generative adversarial nets

Shaoyong Zhang, Na Li, Chenchen Qiu, Zhibin Yu, Haiyong Zheng, Bing Zheng

MULTIMEDIA TOOLS AND APPLICATIONS (2020)

Add to Collection

Article Computer Science, Software Engineering

Learning spectral normalized adversarial systems with stacked structure for high-quality 3D object generation

Haoxu Zhang, Chenchen Qiu, Chao Wang, Bin Wei, Zhibin Yu, Haiyong Zheng, Juan Li

Summary: This paper introduces a new method for generating 3D objects based on generative adversarial networks (GANs), utilizing multiple generators and discriminators to enhance learning complex distributions. The model employs spectral normalization technology to ensure stable training and generate high-quality and realistic 3D objects. Additionally, the system is capable of recovering incomplete 3D objects and outperforms baseline models in object quality.

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE (2021)

Add to Collection

Article Computer Science, Information Systems

KA-Ensemble: towards imbalanced image classification ensembling under-sampling and over-sampling

Hao Ding, Bin Wei, Zhaorui Gu, Zhibin Yu, Haiyong Zheng, Bing Zheng, Juan Li

MULTIMEDIA TOOLS AND APPLICATIONS (2020)

Add to Collection

Article Computer Science, Artificial Intelligence

Discriminative Region Proposal Adversarial Network for High-Quality Image-to-Image Translation

Chao Wang, Wenjie Niu, Yufeng Jiang, Haiyong Zheng, Zhibin Yu, Zhaorui Gu, Bing Zheng

INTERNATIONAL JOURNAL OF COMPUTER VISION (2020)

Add to Collection

Article Computer Science, Information Systems

Fine-grained facial image-to-image translation with an attention based pipeline generative adversarial framework

Yan Zhao, Ziqiang Zheng, Chao Wang, Zhaorui Gu, Min Fu, Zhibin Yu, Haiyong Zheng, Nan Wang, Bing Zheng

MULTIMEDIA TOOLS AND APPLICATIONS (2020)

Add to Collection

Article Chemistry, Analytical

TumorGAN: A Multi-Modal Data Augmentation Framework for Brain Tumor Segmentation

Qingyun Li, Zhibin Yu, Yubo Wang, Haiyong Zheng

SENSORS (2020)

Add to Collection

Article Computer Science, Artificial Intelligence

Fast and accurate online sequential learning of respiratory motion with random convolution nodes for radiotherapy applications

Yubo Wang, Zhibin Yu, Tatinati Sivanagaraja, Kalyana C. Veluvolu

APPLIED SOFT COMPUTING (2020)

Add to Collection

Article Computer Science, Hardware & Architecture

Representation-guided generative adversarial network for unpaired photo-to-caricature translation

Ziqiang Zheng, Hongzhi Liu, Fan Yang, Xingyu Zheng, Zhibin Yu, Shaoda Zhang

Summary: This study introduces an innovative framework for photo-to-caricature translation, using a representation-guided scheme to mimic the caricature style, and introducing a feature-pyramid adversarial network to improve image synthesis quality. Experimental results demonstrate the excellent imitation capabilities of the proposed method across various caricature datasets.

COMPUTERS & ELECTRICAL ENGINEERING (2021)

Add to Collection

Article Chemistry, Analytical

Unpaired Underwater Image Synthesis with a Disentangled Representation for Underwater Depth Map Prediction

Qi Zhao, Zhichao Xin, Zhibin Yu, Bing Zheng

Summary: Estimation of underwater depth maps is crucial in underwater vision research, presenting challenges such as lack of paired data and dynamic underwater environments. Researchers have developed a novel framework combining image translation and depth map estimation techniques, utilizing a coarse-to-fine network for precise depth map estimation. The method efficiently addresses the issues in underwater image synthesis and depth map estimation, providing diverse underwater images and accurate depth map estimation results.

SENSORS (2021)

Add to Collection

Article Computer Science, Artificial Intelligence

Generative Adversarial Network with Multi-branch Discriminator for imbalanced cross-species image-to-image translation

Ziqiang Zheng, Zhibin Yu, Yang Wu, Haiyong Zheng, Bing Zheng, Minho Lee

Summary: This paper introduces a method to address the imbalanced learning problem through cross-species image-to-image translation, and proposes a novel, simple, and effective structure of Multi-Branch Discriminator (MBD) based on Generative Adversarial Networks (GANs). The effectiveness of the MBD is demonstrated through both theoretical analysis and empirical evaluation, showing remarkable performance in various cross-species image translation tasks.

NEURAL NETWORKS (2021)

Add to Collection

Article Computer Science, Information Systems

One-Shot Image-to-Image Translation via Part-Global Learning With a Multi-Adversarial Framework

Ziqiang Zheng, Zhibin Yu, Haiyong Zheng, Yang Yang, Heng Tao Shen

Summary: The paper proposes an effective multi-adversarial framework based on part-global learning for one-shot cross-domain image-to-image translation. Extensive experiments show that the proposed approach achieves impressive results on imbalanced image domains and outperforms existing methods in one-shot image-to-image translation.

IEEE TRANSACTIONS ON MULTIMEDIA (2022)

Add to Collection

Article Computer Science, Information Systems

Underwater Image Enhancement Based on a Spiral Generative Adversarial Framework

Ruyue Han, Yang Guan, Zhibin Yu, Peng Liu, Haiyong Zheng

IEEE ACCESS (2020)

Add to Collection

Article Computer Science, Information Systems

In Situ Holothurian Noncontact Counting System: A General Framework for Holothurian Counting

Xinliang Zhang, Huimin Zeng, Xiang Liu, Zhibin Yu, Haiyong Zheng, Bing Zheng

IEEE ACCESS (2020)

Add to Collection

Article Computer Science, Information Systems

SR-ITM-GAN: Learning 4K UHD HDR With a Generative Adversarial Network

Huimin Zeng, Xinliang Zhang, Zhibin Yu, Yubo Wang

IEEE ACCESS (2020)

Add to Collection

Article Computer Science, Information Systems

Learning Attention-Enhanced Spatiotemporal Representation for Action Recognition

Zhensheng Shi, Liangjie Cao, Cheng Guan, Haiyong Zheng, Zhaorui Gu, Zhibin Yu, Bing Zheng

IEEE ACCESS (2020)

Add to Collection

Article Computer Science, Artificial Intelligence

Reduced-complexity Convolutional Neural Network in the compressed domain

Hamdan Abdellatef, Lina J. Karam

Summary: This paper proposes performing the learning and inference processes in the compressed domain to reduce computational complexity and improve speed of neural networks. Experimental results show that modified ResNet-50 in the compressed domain is 70% faster than traditional spatial-based ResNet-50 while maintaining similar accuracy. Additionally, a preprocessing step with partial encoding is suggested to improve resilience to distortions caused by low-quality encoded images. Training a network with highly compressed data can achieve good classification accuracy with significantly reduced storage requirements.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Theoretical limits on the speed of learning inverse models explain the rate of adaptation in arm reaching tasks

Victor R. Barradas, Yasuharu Koike, Nicolas Schweighofer

Summary: Inverse models are essential for human motor learning as they map desired actions to motor commands. The shape of the error surface and the distribution of targets in a task play a crucial role in determining the speed of learning.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Learning a robust foundation model against clean-label data poisoning attacks at downstream tasks

Ting Zhou, Hanshu Yan, Jingfeng Zhang, Lei Liu, Bo Han

Summary: We propose a defense strategy that reduces the success rate of data poisoning attacks in downstream tasks by pre-training a robust foundation model.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for neural networks

Hao Sun, Li Shen, Qihuang Zhong, Liang Ding, Shixiang Chen, Jingwei Sun, Jing Li, Guangzhong Sun, Dacheng Tao

Summary: In this paper, the convergence rate of AdaSAM in the stochastic non-convex setting is analyzed. Theoretical proof shows that AdaSAM has a linear speedup property and decouples the stochastic gradient steps with the adaptive learning rate and perturbed gradient. Experimental results demonstrate that AdaSAM outperforms other optimizers in terms of performance.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Grasping detection of dual manipulators based on Markov decision process with neural network

Juntong Yun, Du Jiang, Li Huang, Bo Tao, Shangchun Liao, Ying Liu, Xin Liu, Gongfa Li, Disi Chen, Baojia Chen

Summary: In this study, a dual manipulator grasping detection model based on the Markov decision process is proposed. By parameterizing the grasping detection model of dual manipulators using a cross entropy convolutional neural network and a full convolutional neural network, stable grasping of complex multiple objects is achieved. Robot grasping experiments were conducted to verify the feasibility and superiority of this method.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Asymmetric double networks mutual teaching for unsupervised person Re-identification

Miaohui Zhang, Kaifang Li, Jianxin Ma, Xile Wang

Summary: This paper proposes an unsupervised person re-identification (Re-ID) method that uses two asymmetric networks to generate pseudo-labels for each other by clustering and updates and optimizes the pseudo-labels through alternate training. It also designs similarity compensation and similarity suppression based on the camera ID of pedestrian images to optimize the similarity measure. Extensive experiments show that the proposed method achieves superior performance compared to state-of-the-art unsupervised person re-identification methods.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Low-variance Forward Gradients using Direct Feedback Alignment and momentum

Florian Bacho, Dominique Chu

Summary: This paper proposes a new approach called the Forward Direct Feedback Alignment algorithm for supervised learning in deep neural networks. By combining activity-perturbed forward gradients, direct feedback alignment, and momentum, this method achieves better performance and convergence speed compared to other local alternatives to backpropagation.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Maximum margin and global criterion based-recursive feature selection

Xiaojian Ding, Yi Li, Shilin Chen

Summary: This research paper addresses the limitations of recursive feature elimination (RFE) and its variants in high-dimensional feature selection tasks. The proposed algorithms, which introduce a novel feature ranking criterion and an optimal feature subset evaluation algorithm, outperform current state-of-the-art methods.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Mental image reconstruction from human brain activity: Neural decoding of mental imagery via deep neural network-based Bayesian estimation

Naoko Koide-Majima, Shinji Nishimoto, Kei Majima

Summary: Visual images observed by humans can be reconstructed from brain activity, and the visualization of arbitrary natural images from mental imagery has been achieved through an improved method. This study provides a unique tool for directly investigating the subjective contents of the brain.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Hierarchical attention network with progressive feature fusion for facial expression recognition

Huanjie Tao, Qianyue Duan

Summary: In this paper, a hierarchical attention network with progressive feature fusion is proposed for facial expression recognition (FER), addressing the challenges posed by pose variation, occlusions, and illumination variation. The model achieves enhanced performance by aggregating diverse features and progressively enhancing discriminative features.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

SLAPP: Subgraph-level attention-based performance prediction for deep learning models

Zhenyi Wang, Pengfei Yang, Linwei Hu, Bowen Zhang, Chengmin Lin, Wenkai Lv, Quan Wang

Summary: In the face of the complex landscape of deep learning, we propose a novel subgraph-level performance prediction method called SLAPP, which combines graph and operator features through an innovative graph neural network called EAGAT, providing accurate performance predictions. In addition, we introduce a mixed loss design with dynamic weight adjustment to improve predictive accuracy.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

LDCNet: Lightweight dynamic convolution network for laparoscopic procedures image segmentation

Yiyang Yin, Shuangling Luo, Jun Zhou, Liang Kang, Calvin Yu-Chian Chen

Summary: Medical image segmentation is crucial for modern healthcare systems, especially in reducing surgical risks and planning treatments. Transanal total mesorectal excision (TaTME) has become an important method for treating colon and rectum cancers. Real-time instance segmentation during TaTME surgeries can assist surgeons in minimizing risks. However, the dynamic variations in TaTME images pose challenges for accurate instance segmentation.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

start-stop points CenterNet for wideband signals detection and time-frequency localization in spectrum sensing

Teng Cheng, Lei Sun, Junning Zhang, Jinling Wang, Zhanyang Wei

Summary: This study proposes a scheme that combines the start-stop point signal features for wideband multi-signal detection, called Fast Spectrum-Size Self-Training network (FSSNet). By utilizing start-stop points to build the signal model, this method successfully solves the difficulty of existing deep learning methods in detecting discontinuous signals and achieves satisfactory detection speed.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Learning deep representation and discriminative features for clustering of multi-layer networks

Wenming Wu, Xiaoke Ma, Quan Wang, Maoguo Gong, Quanxue Gao

Summary: The layer-specific modules in multi-layer networks are critical for understanding the structure and function of the system. However, existing methods fail to accurately characterize and balance the connectivity and specificity of these modules. To address this issue, a joint learning graph clustering algorithm (DRDF) is proposed, which learns the deep representation and discriminative features of the multi-layer network, and balances the connectivity and specificity of the layer-specific modules through joint learning.

NEURAL NETWORKS (2024)

Add to Collection

Article Computer Science, Artificial Intelligence

Boundary uncertainty aware network for automated polyp segmentation

Guanghui Yue, Guibin Zhuo, Weiqing Yan, Tianwei Zhou, Chang Tang, Peng Yang, Tianfu Wang

Summary: This paper proposes a novel boundary uncertainty aware network (BUNet) for precise and robust colorectal polyp segmentation. BUNet utilizes a pyramid vision transformer encoder to learn multi-scale features and incorporates a boundary exploration module (BEM) and a boundary uncertainty aware module (BUM) to handle boundary areas. Experimental results demonstrate that BUNet outperforms other methods in terms of performance and generalization ability.

NEURAL NETWORKS (2024)

Add to Collection

© Peeref 2019-2024. All rights reserved.