☆ 4.7 Article

Harmony Potentials

INTERNATIONAL JOURNAL OF COMPUTER VISION (2012)

期刊

INTERNATIONAL JOURNAL OF COMPUTER VISION

卷 96, 期 1, 页码 83-102

出版社

SPRINGER

DOI: 10.1007/s11263-011-0449-8

关键词

Semantic object segmentation; Hierarchical conditional random fields

类别

Computer Science, Artificial Intelligence

资金

EU [ERGTS-VICI-224737, VIDI-VIDEO IST-045547, FP7-ICT-24314, FP7-ICT-248873]
Spanish Research Program Consolider-Ingenio: MIPRCV [CSD2007-00018]
Spanish projects [TIN2009-14501-C02-02, TIN2009-14173, TRA2010-21371-C03-01]
Ramon y Cajal fellowship
FPU [AP2008-03378]

向作者/读者索取更多资源

Protocol

Reagent

摘要

The Hierarchical Conditional Random Field (HCRF) model have been successfully applied to a number of image labeling problems, including image segmentation. However, existing HCRF models of image segmentation do not allow multiple classes to be assigned to a single region, which limits their ability to incorporate contextual information across multiple scales. At higher scales in the image, this representation yields an oversimplified model since multiple classes can be reasonably expected to appear within large regions. This simplified model particularly limits the impact of information at higher scales. Since class-label information at these scales is usually more reliable than at lower, noisier scales, neglecting this information is undesirable. To address these issues, we propose a new consistency potential for image labeling problems, which we call the harmony potential. It can encode any possible combination of labels, penalizing only unlikely combinations of classes. We also propose an effective sampling strategy over this expanded label set that renders tractable the underlying optimization problem. Our approach obtains state-of-the-art results on two challenging, standard benchmark datasets for semantic image segmentation: PASCAL VOC 2010, and MSRC-21.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

HCFS3D: Hierarchical coupled feature selection network for 3D semantic and instance segmentation

Jingang Tan, Kangru Wang, Lili Chen, Guanghui Zhang, Jiamao Li, Xiaolin Zhang

Summary: This paper proposes a novel and robust 3D point cloud segmentation framework HCFS3D, which can perform semantic and instance segmentation simultaneously. By using methods like Adaptive Smooth Loss and conditional random fields, the framework shows superior performance in experiments.

IMAGE AND VISION COMPUTING (2021)

添加到收藏夹

Article Engineering, Electrical & Electronic

Superpixel-based object boundary gimmicking using optimized conditional random fields with random associations

Manonmani Arunkumar, Vijayakumari Pushparaj

Summary: The proposed two-level object segmentation framework utilizes a superpixel-based object boundary gimmicking algorithm and optimized conditional random field algorithm to achieve competitive results in terms of image visualization and performance evaluation.

SIGNAL IMAGE AND VIDEO PROCESSING (2021)

添加到收藏夹

Article Environmental Sciences

Road Extraction from Remote Sensing Images Using the Inner Convolution Integrated Encoder-Decoder Network and Directional Conditional Random Fields

Shuyang Wang, Xiaodong Mu, Dongfang Yang, Hao He, Peng Zhao

Summary: In this study, an inner convolution integrated encoder-decoder network with directional conditional random fields post-processing was proposed to extract roads from remote sensing images. The approach achieved high-quality road segmentation and connectivity, addressing the problem caused by occlusions.

REMOTE SENSING (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Refined UNet v3: Efficient end-to-end patch-wise network for cloud and shadow segmentation with multi-channel spectral features

Libin Jiao, Lianzhi Huo, Changmiao Hu, Ping Tang

Summary: Refined UNet v3 upgrades the bilateral message-passing kernel and the efficient implementation of Gaussian filtering in the CRF layer, effectively capturing ambiguous edges and accelerating the message-passing procedure. Experimental results demonstrate that the proposed update outperforms its counterpart in terms of detecting vague edges, shadow retrieval, and isolated redundant regions, and it is practically efficient in our TensorFlow implementation.

NEURAL NETWORKS (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

ShuffleTrans: Patch-wise weight shuffle for transparent object segmentation

Boxiang Zhang, Zunran Wang, Yonggen Ling, Yuanyuan Guan, Shenghao Zhang, Wenhui Li, Lei Wei, Chunxu Zhang

Summary: Transparent object segmentation is a challenging task due to the lack of texture. Shape information plays a critical role in this task. To address this issue, the researchers propose an operation called Patch-wise Weight Shuffle and design a network called ShuffleTrans that performs better in shape recognition. Experimental results on multiple datasets demonstrate the effectiveness of the method in transparent object segmentation.

NEURAL NETWORKS (2023)

添加到收藏夹

Article Ecology

Skyline variations allow estimating distance to trees on landscape photos using semantic segmentation

Laura Martinez-Sanchez, Daniele Borio, Raphael d'Andrimont, Marijn van der Velde

Summary: This article investigates the method of estimating tree distances using variations in the skyline of landscape photos. By extracting skyline height and applying various metrics, the study reveals distance-related information.

ECOLOGICAL INFORMATICS (2022)

添加到收藏夹

Article Geography, Physical

Incorporating DeepLabv3+and object-based image analysis for semantic segmentation of very high resolution remote sensing images

Shouji Du, Shihong Du, Bo Liu, Xiuyuan Zhang

Summary: This study proposes a semantic segmentation method for VHR images by combining a deep learning semantic segmentation model and object-based image analysis, which aims to capture precise outlines of ground objects and explore context information, achieving competitive overall accuracies for Vaihingen and Potsdam datasets.

INTERNATIONAL JOURNAL OF DIGITAL EARTH (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Focus on hierarchical features: Soft-weighted hierarchical features network

Hongkai Lin, Wentian Xin, Shun Chang, Qianxue Yang, Qiguang Miao, Ruyi Liu, Liang Chang

Summary: This paper proposes a novel network structure, SWHF-Net, to address the issues in semantic segmentation, including underutilization of backbone-derived features and mismatch between small objects and large-scale encodings. SWHF-Net consists of ST-FPM and HF2M modules, which utilize feature transformation and hierarchical fusion to improve the semantic representation of multi-scale objects and enhance computational efficiency.

NEUROCOMPUTING (2023)

添加到收藏夹

Article Computer Science, Information Systems

Automatic melanoma detection and segmentation in dermoscopy images using deep RetinaNet and conditional random fields

Hafeez Ur Rehman, Nudrat Nida, Syed Adnan Shah, Wakeel Ahmad, Muhammad Imran Faizi, Syed Muhammad Anwar

Summary: Melanoma, a major cause of death worldwide, is challenging to diagnose and treat. This study proposes a deep learning method that combines image processing, localization, and segmentation to accurately detect and segment melanoma regions.

MULTIMEDIA TOOLS AND APPLICATIONS (2022)

添加到收藏夹

Article Engineering, Electrical & Electronic

Encoder-Decoder With Cascaded CRFs for Semantic Segmentation

Jian Ji, Rui Shi, Sitong Li, Peng Chen, Qiguang Miao

Summary: This article proposes a new semantic segmentation method that enhances the model's ability to locate object boundaries by introducing cascaded CRFs into the decoder and fusing the output with the last decoder's output, resulting in more accurate semantic segmentation results.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2021)

添加到收藏夹

Article Chemistry, Analytical

Fish Segmentation in Sonar Images by Mask R-CNN on Feature Maps of Conditional Random Fields

Chin-Chun Chang, Yen-Po Wang, Shyi-Chyi Cheng

Summary: This paper proposes a method combining Mask R-CNN and PreCNN for fish segmentation in sonar images, improving accuracy and applicability. Using the PreCNN network to extract feature maps, providing standardized inputs for Mask R-CNN, making it better suited for different fish farming environments.

SENSORS (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

A Survey on Deep Learning Technique for Video Segmentation

Tianfei Zhou, Fatih Porikli, David J. Crandall, Luc Van Gool, Wenguan Wang

Summary: Video segmentation is crucial in various practical applications such as enhancing visual effects in movies, understanding scenes in autonomous driving, and creating virtual background in video conferencing. Deep learning-based approaches have shown promising performance in video segmentation. This survey comprehensively reviews two main research lines - generic object segmentation and video semantic segmentation - by introducing their task settings, background concepts, need, development history, and challenges. Representative literature and datasets are also discussed, and the reviewed methods are benchmarked on well-known datasets. Open issues and opportunities for further research are identified, and a public website is provided to track developments in this field: https://github.com/tfzhou/VS-Survey.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

HANA: Hierarchical Attention Network Assembling for Semantic Segmentation

Wei Liu, Ding Li, Hongqi Su

Summary: This study proposed a semantic segmentation framework named hierarchical attention network assembling, which utilizes auxiliary information of different levels corresponding to the hidden and explicit features in the cognitive system, and further processes hidden information to assist the semantic segmentation task.

COGNITIVE COMPUTATION (2021)

添加到收藏夹

Article Environmental Sciences

Semantic Segmentation of Polarimetric SAR Image Based on Dual-Channel Multi-Size Fully Connected Convolutional Conditional Random Field

Yingying Kong, Qiupeng Li

Summary: This paper proposes a polarization SAR image semantic segmentation method based on a dual-channel multi-size fully connected convolutional conditional random field. By inputting the full-polarization SAR image and the corresponding optical image simultaneously, integrating multi-size inputs, and introducing the importance of features, the accuracy of image segmentation is improved.

REMOTE SENSING (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

FarSeg plus plus : Foreground-Aware Relation Network for Geospatial Object Segmentation in High Spatial Resolution Remote Sensing Imagery

Zhuo Zheng, Yanfei Zhong, Junjue Wang, Ailong Ma, Liangpei Zhang

Summary: In this paper, a foreground-aware relation network (FarSeg++) is proposed to address the issues of scale variation, large intra-class variance of background, and foreground-background imbalance in high spatial resolution remote sensing imagery. The network improves the discrimination of foreground features, achieves balanced optimization, and enhances objectness representation. Experimental results demonstrate that FarSeg++ outperforms state-of-the-art semantic segmentation methods and achieves a better trade-off between speed and accuracy.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

添加到收藏夹

Article Automation & Control Systems

Deep Pain: Exploiting Long Short-Term Memory Networks for Facial Expression Classification

Pau Rodriguez, Guillem Cucurull, Jordi Gonzalez, Josep M. Gonfaus, Kamal Nasrollahi, Thomas B. Moeslund, F. Xavier Roca

Summary: This paper proposes an automatic system for pain assessment, which outperforms the latest techniques by feeding the raw frames to deep learning models and considering the temporal relation and whole image. The research achieves competitive results in the UNBC-McMaster Shoulder Pain Expression Archive Database and the Cohn Kanade+ facial expression database.

IEEE TRANSACTIONS ON CYBERNETICS (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

End-to-end global to local convolutional neural network learning for hand pose recovery in depth data

Meysam Madadi, Sergio Escalera, Xavier Baro, Jordi Gonzalez

Summary: This study introduces a novel hierarchical tree-like structured CNN to address the 3D pose estimation of human hands, training branches to specialize in local poses and fusing features to learn higher order dependencies among joints. Furthermore, a non-rigid data augmentation approach is employed to increase training depth data. Experimental results show competitive performance on various datasets.

IET COMPUTER VISION (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Image rain removal and illumination enhancement done in one go

Yecong Wan, Yuanshuo Cheng, Mingwen Shao, Jordi Gonzalez

Summary: In this paper, a novel spatially-adaptive network SANet is proposed for simultaneous rain removal and illumination enhancement. A contrastive loss and a new synthetic dataset DarkRain are introduced to boost the development of rain image restoration algorithms.

KNOWLEDGE-BASED SYSTEMS (2022)

添加到收藏夹

Article Computer Science, Information Systems

Main product detection with graph networks for fashion

Vacit Oguz Yazici, Longlong Yu, Arnau Ramisa, Luis Herranz, Joost van de Weijer

Summary: Computer vision has made progress in the online fashion retail industry by proposing a model that utilizes Graph Convolutional Networks (GCN) to detect fashion products in boundary boxes. Compared to the state-of-the-art approach, this method performs better in scenarios where title-input is missing and during cross-dataset evaluation.

MULTIMEDIA TOOLS AND APPLICATIONS (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Single image super-resolution based on directional variance attention network

Parichehr Behjati, Pau Rodriguez, Carles Fernandez, Isabelle Hupont, Armin Mehri, Jordi Gonzalez

Summary: This study proposes a computationally efficient and accurate single image super-resolution network called DiVANet. By introducing a directional variance attention mechanism and a residual attention feature group, the network is able to improve the performance and efficiency of image recovery.

PATTERN RECOGNITION (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Self-Training for Class-Incremental Semantic Segmentation

Lu Yu, Xialei Liu, Joost van de Weijer

Summary: This paper addresses the problem of catastrophic forgetting in deep neural networks during incremental learning in class-incremental semantic segmentation. A self-training approach is proposed, leveraging unlabeled data for rehearsal of previous knowledge. Experimental results show that maximizing self-entropy and using diverse auxiliary data can significantly improve performance. State-of-the-art results are achieved on Pascal-VOC 2012 and ADE20K datasets.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Computer Science, Information Systems

A Spatio-Temporal Spotting Network with Sliding Windows for Micro-Expression Detection

Wenwen Fu, Zhihong An, Wendong Huang, Haoran Sun, Wenjuan Gong, Jordi Gonzalez

Summary: This study investigates the problem of micro-expression spotting as a frame-by-frame micro-expression classification problem and proposes an effective spotting model. The experimental results demonstrate that the proposed method outperforms the state-of-the-art method in terms of overall F-scores on the CAS(ME)2 and SAMM Long Videos databases.

ELECTRONICS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Class-Incremental Learning: Survey and Performance Evaluation on Image Classification

Marc Masana, Xialei Liu, Bartlomiej Twardowski, Mikel Menta, Andrew D. Bagdanov, Joost van de Weijer

Summary: For future learning systems, incremental learning is desirable due to its efficient resource usage, reduced memory usage, and resemblance to human learning. The main challenge for incremental learning is catastrophic forgetting. This paper provides a comprehensive survey of existing class-incremental learning methods for image classification and performs extensive experimental evaluations on thirteen methods.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

添加到收藏夹

Proceedings Paper Computer Science, Artificial Intelligence

3D-Aware Multi-Class Image-to-Image Translation with NeRFs

Senmao Li, Joost van de Weijer, Yaxing Wang, Fahad Shahbaz Khan, Meiqin Liu, Jian Yang

Summary: Recent advances in 3D-aware generative models combined with Neural Radiance Fields have achieved impressive results in 3D consistent multi-class image-to-image translation. To address the unrealistic shape/identity change in 2D-I2I translation, the learning process is divided into a multi-class 3D-aware GAN step and a 3D-aware I2I translation step, with novel techniques proposed to reduce view-consistency problems.

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2023)

添加到收藏夹

Proceedings Paper Computer Science, Artificial Intelligence

MVMO: A MULTI-OBJECT DATASET FOR WIDE BASELINE MULTI-VIEW SEMANTIC SEGMENTATION

Aitor Alvarez-Gila, Joost van de Weijer, Yaxing Wang, Estibaliz Garrote

Summary: MVMO is a synthetic dataset with high object density and wide camera baselines, enabling research in multi-view semantic segmentation and cross-view semantic transfer. New research is needed to utilize the information from multi-view setups effectively.

2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP (2022)

添加到收藏夹

Proceedings Paper Computer Science, Artificial Intelligence

Visual Transformers with Primal Object Queries for Multi-Label Image Classification

Vacit Oguz Yazici, Joost Van De Weijer, Longlong Yu

Summary: This paper investigates the problem of multi-label image classification and proposes an enhanced transformer model that utilizes primal object queries to improve model performance and convergence speed.

2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2022)

添加到收藏夹

Proceedings Paper Computer Science, Theory & Methods

Transferring Unconditional to Conditional GANs with Hyper-Modulation

Hector Laria, Yaxing Wang, Joost van de Weijer, Bogdan Raducanu

Summary: GANs have matured in recent years and can generate high-resolution, realistic images. This paper focuses on transferring from high-quality pretrained unconditional GANs to conditional GANs, proposing hyper-modulated generative networks for architectural adaptation and introducing self-initialization and contrastive loss for improved transfer efficiency.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022 (2022)

添加到收藏夹

Proceedings Paper Computer Science, Theory & Methods

Area Under the ROC Curve Maximization for Metric Learning

Bojana Gajic, Ariel Amato, Ramon Baldrich, Joost van de Weijer, Carlo Gatta

Summary: Most popular metric learning losses are not directly related to the evaluation metrics used to assess their performance. However, training a metric learning model by maximizing the area under the ROC curve can induce a suitable implicit ranking for retrieval problems. By proposing an approximated and derivable AUC loss, state-of-the-art performance is achieved on large scale retrieval benchmark datasets.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022 (2022)

添加到收藏夹

Proceedings Paper Computer Science, Artificial Intelligence

Class-Balanced Active Learning for Image Classification

Javad Zolfaghari Bengar, Joost van de Weijer, Laura Lopez Fuentes, Bogdan Raducanu

Summary: In real-world scenarios, imbalanced class distribution in datasets further complicates the active learning process. To address this issue, we propose an optimization framework considering class-balancing, which can effectively improve the performance of active learning methods.

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) (2022)

添加到收藏夹

Article Computer Science, Information Systems

Frequency-Based Enhancement Network for Efficient Super-Resolution

Parichehr Behjati, Pau Rodriguez, Carles Fernandez Tena, Armin Mehri, F. Xavier Roca, Seiichi Ozawa, Jordi Gonzalez

Summary: This study focuses on single image super-resolution based on deep convolutional neural networks (CNNs), proposing a novel Frequency-based Enhancement Block (FEB) to enhance high-frequency information and recover finer details. Experimental results show that replacing commonly used SR blocks with FEB improves reconstruction error and reduces the number of parameters in the model.

IEEE ACCESS (2022)

添加到收藏夹

暂无数据

© Peeref 2019-2024. All rights reserved.