4.7 Article

A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING
Volume 28, Issue 11, Pages -

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2019.2921877

Keywords

High efficiency video coding; in-loop filter; deep learning; multiple frames

Funding

  1. National Natural Science Foundation of China (NSFC) [61876013, 61573037]
  2. Fok Ying Tung Education Foundation [151061]

Ask authors/readers for more resources

An extensive study on the in-loop filter has been proposed for a high efficiency video coding (HEVC) standard to reduce compression artifacts, thus improving coding efficiency. However, in the existing approaches, the in-loop filter is always applied to each single frame, without exploiting the content correlation among multiple frames. In this paper, we propose a multi-frame in-loop filter (MIF) for HEVC, which enhances the visual quality of each encoded frame by leveraging its adjacent frames. Specifically, we first construct a large-scale database containing encoded frames and their corresponding raw frames of a variety of content, which can he used to learn the in-loop filter in HEVC. Furthermore, we find that there usually exist a number of reference frames of higher quality and of similar content for an encoded frame. Accordingly, a reference frame selector (RFS) is designed to identify these frames. Then, a deep neural network for MIF (known as MIF-Net) is developed to enhance the quality of each encoded frame by utilizing the spatial information of this frame and the temporal information of its neighboring higher-quality frames. The MIF-Net is built on the recently developed DenseNet, benefiting from its improved generalization capacity and computational efficiency. In addition, a novel block-adaptive convolutional layer is designed and applied in the MIF-Net, for handling the artifacts influenced by coding tree unit (CTU) structure in HEVC. Extensive experiments show that our MIF approach achieves on average 11.621% saving of the Bjontegaard delta bit-rate (BD-BR) on the standard test set, significantly outperforming the standard in-loop filter in HEVC and other state-of-the-art approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Engineering, Electrical & Electronic

Learning for Video Compression With Recurrent Auto-Encoder and Recurrent Probability Model

Ren Yang, Fabian Mentzer, Luc Van Gool, Radu Timofte

Summary: This paper introduces a Recurrent Learned Video Compression (RLVC) approach with Recurrent Auto-Encoder (RAE) and Recurrent Probability Model (RPM) to better utilize the temporal correlation among video frames, achieving state-of-the-art compression performance.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2021)

Article Computer Science, Artificial Intelligence

Hierarchical Bayesian LSTM for Head Trajectory Prediction on Omnidirectional Images

Li Yang, Mai Xu, Yichen Guo, Xin Deng, Fangyuan Gao, Zhenyu Guan

Summary: This paper proposes a novel approach for modeling human attention on omnidirectional images (ODIs) by integrating hierarchical Bayesian inference and long short-term memory (LSTM) network. The approach predicts head trajectories and saliency on ODIs by capturing temporal correlations and modeling inter-subject uncertainty. Extensive experiments demonstrate the superior performance of the proposed approach compared to state-of-the-art methods.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

DeepMIH: Deep Invertible Network for Multiple Image Hiding

Zhenyu Guan, Junpeng Jing, Xin Deng, Mai Xu, Lai Jiang, Zhou Zhang, Yipeng Li

Summary: This paper proposes a novel multiple image hiding framework DeepMIH based on invertible neural network. An invertible hiding neural network (IHNN) is developed to model the image concealing and revealing as its forward and backward processes innovatively, making them fully coupled and reversible. In addition, an importance map (IM) module is designed to guide the current image hiding and enhance the invisibility. Experimental results show that DeepMIH significantly outperforms other state-of-the-art methods in terms of hiding invisibility, security and recovery accuracy.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Engineering, Electrical & Electronic

A Lightweight and Machine-Learning-Resistant PUF Using Obfuscation-Feedback-Shift-Register

Zhuojun Chen, Wenshang Lee, Qinhui Hong, Chongyan Gu, Zhenyu Guan, Lin Ding, Jiliang Zhang

Summary: The paper presents a hardware security mechanism called OFSR PUF, which consists of weak PUF cells and an obfuscation mechanism. It effectively reduces storage overhead, overcomes collapse response, and provides higher security.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS (2022)

Article Computer Science, Information Systems

Extremely Lightweight PUF-based Batch Authentication Protocol for End-Edge-Cloud Hierarchical Smart Grid

Feifei Liu, Yu Yan, Yu Sun, Jianwei Liu, Dawei Li, Zhenyu Guan

Summary: This paper proposes a PUF-based batch authentication and key agreement protocol to protect both meters and gateways in the smart grid and provide end-to-end authentication. The computation overhead is reduced significantly by offloading heavy operations from field devices to the server. Additionally, devolving batch authentication and access control to the gateway decreases downlink communication and signaling cost and surpasses most recent schemes.

SECURITY AND COMMUNICATION NETWORKS (2022)

Article Engineering, Electrical & Electronic

Advancing Learned Video Compression With In-Loop Frame Prediction

Ren Yang, Radu Timofte, Luc Van Gool

Summary: In recent years, there has been an increasing interest in end-to-end learned video compression. Previous works focused on compressing motion maps to exploit temporal redundancy. However, they did not fully utilize historical information in sequential reference frames. This paper proposes an Advanced Learned Video Compression (ALVC) approach with an in-loop frame prediction module, which effectively predicts the target frame from previously compressed frames. The experiments demonstrate the state-of-the-art performance of ALVC in learned video compression.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Hardware & Architecture

Achieving Fine-Grained Data Sharing for Hierarchical Organizations in Clouds

Hua Deng, Zheng Qin, Qianhong Wu, Robert H. Deng, Zhenyu Guan, Yupeng Hu, Fangmin Li

Summary: Cloud computing is popular for data storage and sharing. Encryption is important for data security, but can hinder data sharing. This article proposes a hierarchical data sharing scheme that allows the data owner to selectively share encrypted data with users in a hierarchy, providing control over access.

IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING (2023)

Article Engineering, Electrical & Electronic

Lightweight and Privacy-Preserving Charging Reservation Authentication Protocol for 5G-V2G

Wanyu Hou, Yu Sun, Dawei Li, Zhenyu Guan, Jianwei Liu

Summary: This paper introduces a 5G-V2G system that integrates the 5G network and the power grid to enable the remote transmission of reservation information. A scalable reservation authentication and key agreement protocol is proposed to support flexible access and ensure the confidentiality and privacy of critical reservation data. The protocol utilizes lightweight algorithms and physical unclonable function (PUF) for security, outperforming existing schemes in terms of computing overhead, transmission overhead, signaling overhead, and security.

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

MASIC: Deep Mask Stereo Image Compression

Xin Deng, Yufan Deng, Ren Yang, Wenzhe Yang, Radu Timofte, Mai Xu

Summary: In this paper, a novel network model called MASIC is proposed for stereo image compression. It achieves higher compression efficiency and quality through the introduction of a mask prediction module and mask conditional stereo entropy model.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Automation & Control Systems

Consensus Control Based on Privacy-Preserving Two-Party Relationship Test Protocol

Hanzhou Wang, Dongyu Li, Zhenyu Guan, Yizhong Liu, Jianwei Liu

Summary: This paper introduces a framework to protect privacy in multi-agent systems, allowing the states to reach a consensus while keeping the initial states confidential. A privacy-preserving two-party relationship test protocol is proposed, which is then used to devise average consensus and rendezvous controllers for first- and second-order systems. Unlike previous research that relies on stochastic coupling weights, our approach overcomes the random chattering problem in control input, leading to improved convergence performance. Numerical verification is conducted to demonstrate the effectiveness of the proposed controllers.

IEEE CONTROL SYSTEMS LETTERS (2023)

Article Computer Science, Theory & Methods

A Flexible Sharding Blockchain Protocol Based on Cross-Shard Byzantine Fault Tolerance

Yizhong Liu, Xinxin Xing, Haosu Cheng, Dawei Li, Zhenyu Guan, Jianwei Liu, Qianhong Wu

Summary: This paper proposes a flexible sharding (FS) blockchain protocol that addresses the drawbacks of existing sharding blockchain schemes through the design of a cross-shard Byzantine fault tolerance protocol, multiple parallel CSBFT, a defense mechanism against cross-shard transaction censorship attacks, and a secure and decentralized shard reconfiguration method. The paper also provides a formal protocol design method and security proofs for each protocol. The evaluation shows that FS has lower complexity and achieves significant performance improvements.

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY (2023)

Article Engineering, Civil

Intelligent and Fair IoV Charging Service Based on Blockchain With Cross-Area Consensus

Dawei Li, Ruonan Chen, Qinjun Wan, Zhenyu Guan, Shizhong Li, Min Xie, Jieyu Su, Jianwei Liu

Summary: The emergence of electric vehicles has led to the development of the Internet of Vehicles (IoV), but there are various security issues that need to be addressed. This paper proposes a blockchain-based intelligent and fair IoV charging service system that intelligently recommends charging piles for vehicles and ensures fairness between charging and payment through a payment channel protocol.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Article Engineering, Electrical & Electronic

New Finding and Unified Framework for Fake Image Detection

Xin Deng, Bihe Zhao, Zhenyu Guan, Mai Xu

Summary: Recently, fake face images generated by generative adversarial network (GAN) have become a major concern in social networks due to the security risks they pose. This study reveals that GAN generated fake images have stronger non-local self-similarity than real images, leading to the development of NAFID, a non-local attention based fake image detection network. Experimental results demonstrate the superiority of NAFID over state-of-the-art face forgery detection methods and the potential to improve the detection accuracy of other models.

IEEE SIGNAL PROCESSING LETTERS (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Implicit Neural Representations for Image Compression

Yannick Strumpler, Janis Postels, Ren Yang, Luc Van Gool, Federico Tombari

Summary: This study introduces a compression method based on Implicit Neural Representations (INRs), which significantly improves compression quality and outperforms traditional algorithms through meta-learned initializations and network structure improvements.

COMPUTER VISION, ECCV 2022, PT XXVI (2022)

Proceedings Paper Computer Science, Theory & Methods

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results

Ren Yang, Radu Timofte, Meisong Zheng, Qunliang Xing, Minglang Qiao, Mai Xu, Lai Jiang, Huaida Liu, Ying Chen, Youcheng Ben, Xiao Zhou, Chen Fu, Pei Cheng, Gang Yu, Junyi Li, Renlong Wu, Zhilu Zhang, Wei Shang, Zhengyao Lv, Yunjin Chen, Mingcai Zhou, Dongwei Ren, Kai Zhang, Wangmeng Zuo, Pavel Ostyakov, Vyal Dmitry, Shakarim Soltanayev, Chervontsev Sergey, Zhussip Magauiya, Xueyi Zou, Youliang Yan, Pablo Navarrete Michelini, Yunhua Lu, Diankai Zhang, Shaoli Liu, Si Gao, Biao Wu, Chengjian Zheng, Xiaofeng Zhang, Kaidi Lu, Ning Wang, Thuong Nguyen Canh, Thong Bach, Qing Wang, Xiaopeng Sun, Haoyu Ma, Shijie Zhao, Junlin Li, Liangbin Xie, Shuwei Shi, Yujiu Yang, Xintao Wang, Jinjin Gu, Chao Dong, Xiaodi Shi, Chunmei Nian, Dong Jiang, Jucai Lin, Zhihuai Xie, Mao Ye, Dengyan Luo, Liuhan Peng, Shengjie Chen, Xin Liu, Qian Wang, Boyang Liang, Hang Dong, Yuhao Huang, Kai Chen, Xingbei Guo, Yujing Sun, Huilei Wu, Pengxu Wei, Yulin Huang, Junying Chen, Ik Hyun Lee, Sunder Ali Khowaja, Jiseok Yoon

Summary: This paper reviews the NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video, introducing the dataset, tracks, participating teams, and final results. The challenge evaluates the state-of-the-art techniques in super-resolution and quality enhancement of compressed video, providing relevant datasets and code resources.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022 (2022)

No Data Available