4.8 Article

Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System

期刊

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
卷 18, 期 8, 页码 5171-5179

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2021.3122801

关键词

Task analysis; Head; Semantics; Anomaly detection; Video surveillance; Feature extraction; Training; Anomaly detection; contrastive learning; deep learning; intelligent video surveillance; unsupervised learning

资金

  1. Establishment of Key Laboratory of Shenzhen Science and Technology Innovation Committee [ZDSYS20190902093015527]
  2. Shenzhen Fundamental Research Fund [JCYJ20190806142416685, TII-21-2623]

向作者/读者索取更多资源

In this article, a novel method called TAC-Net is proposed to address the problem of anomaly detection in intelligent video surveillance. It calculates anomaly scores by utilizing contrastive similarity and employs deep contrastive self-supervised learning in multiple self-supervised tasks to capture high-level semantic features.
The continuous developments of urban and industrial environments have increased the demand for intelligent video surveillance. Deep learning has achieved remarkable performance for anomaly detection in surveillance videos. Previous approaches achieve anomaly detection with a single-pretext task (image reconstruction or prediction) and detect anomalies by larger reconstruction error or poor prediction. However, they cannot fully exploit the discriminative semantics and temporal context information. Moreover, tackling anomaly detection with a single pretext task is suboptimal due to the nonalignment between the pretext task and anomaly detection. In this article, we propose a temporal-aware contrastive network (TAC-Net) to address the abovementioned problems of anomaly detection for intelligence video surveillance. TAC-Net is an unsupervised method that utilizes deep contrastive self-supervised learning to capture the high-level semantic features and tackles anomaly detection with multiple self-supervised tasks. During inference phase, the multiple task losses and contrastive similarity are utilized to calculate the anomaly score. Experimental results show that our method is superior to state-of-the-art approaches on three benchmarks, which demonstrates the validity and advancement of TAC-Net.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Image Quality Assessment-driven Reinforcement Learning for Mixed Distorted Image Restoration

Xiaoyu Zhang, Wei Gao, Ge Li, Qiuping Jiang, Runmin Cong

Summary: Due to the diverse nature of degradation process, recovery of mixed distorted images remains challenging. Training deep learning models for one degradation type leads to significant decline in performance for other degradation types. In this article, we propose a hierarchical toolkit to address the limitations of a single deep network and explore the use of reinforcement learning for efficient restoration of distorted images. Our method accurately captures distortion preferences and achieves quality improvements through exploration using various evaluation methods. Experimental results show significant improvement in peak signal-to-noise ratio compared to state-of-the-art RL-based methods.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2023)

Article Engineering, Electrical & Electronic

Incomplete Multiview Clustering via Cross-View Relation Transfer

Yiming Wang, Dongxia Chang, Zhiqiang Fu, Jie Wen, Yao Zhao

Summary: In this paper, the problem of multi-view clustering on incomplete views is studied. A novel incomplete multi-view clustering framework is proposed, which incorporates cross-view relation transfer and multi-view fusion learning. Extensive experiments on real datasets demonstrate the effectiveness of the proposed method.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

Cross-Modality Double Bidirectional Interaction and Fusion Network for RGB-T Salient Object Detection

Zhengxuan Xie, Feng Shao, Gang Chen, Hangwei Chen, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho

Summary: RGB-T salient object detection aims to detect and segment saliency regions on RGB images and thermal maps. The proposed CMDBIF-Net effectively reduces the modality difference and achieves outstanding performance.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Artificial Intelligence

Selecting High-Quality Proposals for Weakly Supervised Object Detection With Bottom-Up Aggregated Attention and Phase-Aware Loss

Zhihao Wu, Chengliang Liu, Jie Wen, Yong Xu, Jian Yang, Xuelong Li

Summary: Weakly supervised object detection (WSOD) has gained attention due to its use of image-category annotations for training. We propose selecting high-quality proposals and addressing the issues of part domination and untight boxes to improve WSOD performance. Our approach significantly improves the performance of WSOD according to experiments on PASCAL VOC and MS COCO datasets.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Computer Science, Information Systems

Multiple Instance Detection Networks With Adaptive Instance Refinement

Zhihao Wu, Jie Wen, Yong Xu, Jian Yang, David Zhang

Summary: Weakly supervised object detection (WSOD) aims to train object detectors using only image-level annotations. Many recent works use multiple instance detection networks (MIDN), but these methods tend to detect salient objects and parts due to lack of instance-level annotations. In this paper, they propose an adaptive instance refinement (AIR) framework to address this issue.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Engineering, Electrical & Electronic

A Perception-Aware Decomposition and Fusion Framework for Underwater Image Enhancement

Yaozu Kang, Qiuping Jiang, Chongyi Li, Wenqi Ren, Hantao Liu, Pengjun Wang

Summary: This paper proposes a perception-aware decomposition and fusion framework for underwater image enhancement (UIE). Two complementary pre-processed inputs are fused in a perception-aware and conceptually independent image space through a structural patch decomposition and fusion (SPDF) approach. The main advantage of SPDF is that it can fuse different components separately without any interactions and information loss. Comprehensive comparisons demonstrate that SPDF outperforms state-of-the-art UIE algorithms.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

Modality-Induced Transfer-Fusion Network for RGB-D and RGB-T Salient Object Detection

Gang Chen, Feng Shao, Xiongli Chai, Hangwei Chen, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho

Summary: This paper proposes a novel Modality-Induced Transfer-Fusion Network (MITF-Net) for RGB-D and RGB-T salient object detection. The network is designed to capture the complementary information of multi-modality data and improve performance through modality transfer fusion and cycle-separated attention modules. The proposed MITF-Net achieves competitive and excellent performance on RGB-D and RGB-T salient object detection datasets.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Biochemical Research Methods

PreTP-Stack: Prediction of Therapeutic Peptides Based on the Stacked Ensemble Learing

Ke Yan, Hongwu Lv, Jie Wen, Yichen Guo, Yong Xu, Bin Liu

Summary: In this study, a novel stacking framework PreTP-Stack is proposed for predicting different types of therapeutic peptides. The proposed method achieved better or highly comparable performance with the state-of-the-art methods for predicting eight types of therapeutic peptides. A user-friendly web-server predictor is also available for use.

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (2023)

Review Mathematics

Multiview-Learning-Based Generic Palmprint Recognition: A Literature Review

Shuping Zhao, Lunke Fei, Jie Wen

Summary: This paper summarizes six single-view palmprint representation methods published from 2004 to 2022 and discusses multiview-based palmprint recognition methods. There is currently no work summarizing the multiview fusion for different types of palmprint features.

MATHEMATICS (2023)

Article Computer Science, Artificial Intelligence

Adaptive Context Exploration Network for Polyp Segmentation in Colonoscopy Images

Guanghui Yue, Siying Li, Tianwei Zhou, Miaohui Wang, Jingfeng Du, Qiuping Jiang, Wei Gao, Tianfu Wang, Jun Lv

Summary: Automatic and accurate polyp segmentation is a challenging issue. We construct a benchmark dataset and propose a novel adaptive context exploration network (ACENet) which achieves superior performance on multiple evaluation metrics over state-of-the-art methods.

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE (2023)

Article Engineering, Electrical & Electronic

Bidirectional Collaborative Mentoring Network for Marine Organism Detection and Beyond

Jinguang Cheng, Zongwei Wu, Shuo Wang, Cedric Demonceaux, Qiuping Jiang

Summary: This article introduces a novel organism detection method called BCMNet, which fully utilizes texture and context clues during the encoding and decoding stages, improving the accuracy of locating the target organism object in marine scenes.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

Quality Evaluation of Arbitrary Style Transfer: Subjective Study and Objective Metric

Hangwei Chen, Feng Shao, Xiongli Chai, Yuese Gu, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho

Summary: This paper discusses the important topic of arbitrary neural style transfer, which has significant research value and wide industrial application. The focus is on improving the quality of arbitrary style transfer (AST) and the lack of exploration in quality evaluation of AST images. The paper proposes a new AST images quality assessment database (AST-IQAD) and a sparse representation-based method for measuring the image quality.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Artificial Intelligence

Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection

Chao Huang, Jie Wen, Yong Xu, Qiuping Jiang, Jian Yang, Yaowei Wang, David Zhang

Summary: In this article, a novel self-supervised framework for unsupervised video anomaly detection is proposed. The framework combines a self-attentive predictor, a vanilla discriminator, and a self-supervised discriminator to capture the regular patterns in videos and effectively detect abnormal frames.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Automation & Control Systems

Latent Low-Rank Representation With Weighted Distance Penalty for Clustering

Zhiqiang Fu, Yao Zhao, Dongxia Chang, Yiming Wang, Jie Wen

Summary: In this article, we propose a latent low-rank representation method with weighted distance penalty (LLRRWD) for clustering. By considering the local geometric information and using a weighted distance penalty, our method improves the discrimination of the learned affinity matrix, while reducing the effect of noise and redundancy.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

暂无数据