4.6 Article

Local-to-global mesh saliency

期刊

VISUAL COMPUTER
卷 34, 期 3, 页码 323-336

出版社

SPRINGER
DOI: 10.1007/s00371-016-1334-9

关键词

Mesh saliency; Laplacian; Global distinctness

资金

  1. EPSRC [EP/L006685/1]
  2. Engineering and Physical Sciences Research Council [EP/L006685/1] Funding Source: researchfish
  3. EPSRC [EP/L006685/1] Funding Source: UKRI

向作者/读者索取更多资源

As a measure of regional importance in agreement with human perception of 3D shape, mesh saliency should be based on local geometric information within a mesh but more than that. Recent research has shown that global consideration has a significant role in mesh saliency. This paper proposes a local-to-global framework for computing mesh saliency where we offer novel solutions to solve three inherent problems: (1) an algorithm based on statistic Laplacian which does not only compute local saliency, but also facilitates the later computation of global saliency; (2) a local-to-global method based on pooling and global distinctness to compute global saliency; (3) a framework to integrate local and global saliency. Experiments demonstrate that our approach can effectively detect salient features consistent with human perceptual interest. We also provide comparisons to existing state-of-the-art methods for mesh saliency and show improved results produced by our method.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Software Engineering

Mesh Saliency via Weakly Supervised Classification-for-Saliency CNN

Ran Song, Yonghuai Liu, Paul L. Rosin

Summary: A novel network trained in a weakly supervised manner, called CfS-CNN, has been developed to solve the difficulty of collecting vertex-level annotation for mesh saliency detection. This network outperforms existing state-of-the-art methods in mesh saliency detection and can be directly applied to scene saliency. Experimental results demonstrate the significant improvement of this approach.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2021)

Article Computer Science, Software Engineering

Visual complexity of shapes: a hierarchical perceptual learning model

Lingchen Dai, Kang Zhang, Xianjun Sam Zheng, Ralph R. Martin, Yina Li, Jinhui Yu

Summary: Understanding how people perceive the visual complexity of shapes has theoretical and practical implications, with one school focusing on local features and another emphasizing global characteristics. Inspired by neuroscience, a model incorporating both local and global features is 92% in agreement with human ratings. It is also consistent with the hierarchical perceptual learning theory on how different layers of neurons contribute to the perception of visual shape complexity.

VISUAL COMPUTER (2022)

Article Computer Science, Information Systems

JoT-GAN: A Framework for Jointly Training GAN and Person Re-Identification Model

Zhongwei Zhao, Ran Song, Qian Zhang, Peng Duan, Youmei Zhang

Summary: In this paper, a joint training framework called JoT-GAN is proposed to simultaneously train GAN and person re-identification (re-id) model. The optimal solutions for both generator and re-id model are achieved through mutual guidance by a discriminator. Experimental results on benchmark datasets show that the proposed joint training framework outperforms existing methods with separate training and achieves state-of-the-art re-id performance.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2022)

Article Computer Science, Artificial Intelligence

3D Layout Estimation via Weakly Supervised Learning of Plane Parameters From 2D Segmentation

Weidong Zhang, Youmei Zhang, Ran Song, Ying Liu, Wei Zhang

Summary: In this paper, a novel weakly supervised learning framework is proposed to effectively learn the 3D layout of an indoor scene using 2D layout segmentation mask as supervision. A deep neural network is employed to predict the plane parameters and camera intrinsic parameters, which are used to generate the 3D layout, depth map, and 2D segmentation. Label consistency and depth consistency are utilized as key objectives to supervise the learning process, while 3D geometric reasoning and prior knowledge are incorporated to ensure the realism and reasonableness of the learned 3D layout.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Robotics

Bio-Inspired Rhythmic Locomotion for Quadruped Robots

Jiapeng Sheng, Yanyun Chen, Xing Fang, Wei Zhang, Ran Song, Yu Zheng, Yibin Li

Summary: This paper explores the locomotion mechanisms of mammals and applies them to the design of control architectures for robots. By incorporating a rhythm generator to activate periodic motor patterns and using joint position increments as the outputs of RL policy, the proposed method can achieve a full spectrum of quadruped locomotion in both simulated and real-world scenarios.

IEEE ROBOTICS AND AUTOMATION LETTERS (2022)

Article Computer Science, Artificial Intelligence

3D vessel-like structure segmentation in medical images by an edge-reinforced network

Likun Xia, Hao Zhang, Yufei Wu, Ran Song, Yuhui Ma, Lei Mou, Jiang Liu, Yixuan Xie, Ming Ma, Yitian Zhao

Summary: The vessel-like structure in biomedical images is essential for understanding diseases and diagnosing and treating them. However, existing segmentation methods often produce unsatisfactory results due to challenges in identifying crisp edges.

MEDICAL IMAGE ANALYSIS (2022)

Article Automation & Control Systems

Heuristics Integrated Deep Reinforcement Learning for Online 3D Bin Packing

Shuo Yang, Shuai Song, Shilei Chu, Ran Song, Jiyu Cheng, Yibin Li, Wei Zhang

Summary: This study investigates how to effectively integrate heuristics into the online 3D bin-packing problem by combining heuristic methods with deep reinforcement learning (DRL) techniques. Three different heuristics are designed based on real-world physical rules and experiences of human packers. They are then modeled into the DRL framework, resulting in a novel heuristic DRL method. Experimental results demonstrate that this method achieves state-of-the-art bin packing performance and can reliably accomplish bin packing tasks in real logistics scenarios.

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

SMAM: Self and Mutual Adaptive Matching for Skeleton-Based Few-Shot Action Recognition

Zhiheng Li, Xuyuan Gong, Ran Song, Peng Duan, Jun Liu, Wei Zhang

Summary: This paper proposes a self and mutual adaptive matching (SMAM) module to enhance the discriminability of feature maps extracted from sparse skeleton representation for few-shot action recognition. Experimental results demonstrate that the proposed SMAM-Net outperforms other baselines on the large-scale NTU RGB + D 120 dataset and generalizes well on smaller datasets. Codes and pretrained models will be publicly available.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Computer Science, Interdisciplinary Applications

Classification of Brain Disorders in rs-fMRI via Local-to-Global Graph Neural Networks

Hao Zhang, Ran Song, Liping Wang, Lin Zhang, Dawei Wang, Cong Wang, Wei Zhang

Summary: This paper proposes a local-to-global graph neural network (LG-GNN) to address the issue of accurate classification in brain disorders by considering non-imaging information and the relationship between subjects. The proposed LG-GNN achieves state-of-the-art performance in ASD and AD classification.

IEEE TRANSACTIONS ON MEDICAL IMAGING (2023)

Article Automation & Control Systems

Explicit-to-Implicit Robot Imitation Learning by Exploring Visual Content Change

Shuo Yang, Wei Zhang, Ran Song, Weizhi Lu, Hesheng Wang, Yibin Li

Summary: Demonstration understanding is crucial for robot imitation learning. This work investigates the visual change-based representation and builds imitation learning pipelines in both explicit and implicit ways. Experimental results show that the visual change-based approaches achieve state-of-the-art performance and indicate the superiority of the implicit method for imitation learning.

IEEE-ASME TRANSACTIONS ON MECHATRONICS (2022)

Article Engineering, Civil

Unsupervised Maritime Vessel Re-Identification With Multi-Level Contrastive Learning

Qian Zhang, Mingxin Zhang, Jinghe Liu, Xuanyu He, Ran Song, Wei Zhang

Summary: This paper proposes a simple but effective method to solve the hard-positive problem in vessel re-identification. It constructs a multi-level contrastive learning framework trained with a specifically designed intra-batch cluster-level contrastive loss and an instance-level contrastive loss. Experimental results on a newly proposed dataset show that this method achieves state-of-the-art performance compared to other unsupervised methods.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Article Automation & Control Systems

Watch and Act: Learning Robotic Manipulation From Visual Demonstration

Shuo Yang, Wei Zhang, Ran Song, Jiyu Cheng, Hesheng Wang, Yibin Li

Summary: This article proposes a watch-and-act imitation learning pipeline that enables robots to learn diverse manipulations from visual demonstrations. The system consists of a captioning module for understanding the demonstration videos and a manipulation module for learning the demonstrated manipulations. Extensive experiments validate the superiority of the two modules and a real robotic arm is used for demonstrating the whole robotic imitation system.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

Article Computer Science, Information Systems

Unsupervised Embedding Learning With Mutual-Information Graph Convolutional Networks

Lin Zhang, Mingxin Zhang, Ran Song, Ziying Zhao, Xiaolei Li

Summary: This paper proposes an unsupervised method based on graph convolutional network for embedding learning, which encodes the structural information between samples and leverages augmentation to learn discriminative embeddings.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

3D Visual Saliency: An Independent Perceptual Measure or a Derivative of 2D Image Saliency?

Ran Song, Wei Zhang, Yitian Zhao, Yonghuai Liu, Paul L. Rosin

Summary: This paper proposes a framework that combines a Generative Adversarial Network and a Conditional Random Field to learn the visual saliency of 3D objects and scenes. The experimental results demonstrate that this method outperforms the existing approaches in predicting human fixations and addresses the research question about 3D visual saliency.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Automation & Control Systems

A Hierarchical Framework for Quadruped Omnidirectional Locomotion Based on Reinforcement Learning

Wenhao Tan, Xing Fang, Wei Zhang, Ran Song, Teng Chen, Yu Zheng, Yibin Li

Summary: This paper proposes a learning system for quadruped robots that can adapt to various terrains without pre-training. By introducing a hierarchical framework and reinforcement learning, the robot achieves agile locomotion in the real world.

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING (2023)

暂无数据