☆ 4.7 Article

A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC

IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING

Volume 28, Issue 11, Pages -

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIP.2019.2921877

Keywords

High efficiency video coding; in-loop filter; deep learning; multiple frames

Funding

National Natural Science Foundation of China (NSFC) [61876013, 61573037]
Fok Ying Tung Education Foundation [151061]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

An extensive study on the in-loop filter has been proposed for a high efficiency video coding (HEVC) standard to reduce compression artifacts, thus improving coding efficiency. However, in the existing approaches, the in-loop filter is always applied to each single frame, without exploiting the content correlation among multiple frames. In this paper, we propose a multi-frame in-loop filter (MIF) for HEVC, which enhances the visual quality of each encoded frame by leveraging its adjacent frames. Specifically, we first construct a large-scale database containing encoded frames and their corresponding raw frames of a variety of content, which can he used to learn the in-loop filter in HEVC. Furthermore, we find that there usually exist a number of reference frames of higher quality and of similar content for an encoded frame. Accordingly, a reference frame selector (RFS) is designed to identify these frames. Then, a deep neural network for MIF (known as MIF-Net) is developed to enhance the quality of each encoded frame by utilizing the spatial information of this frame and the temporal information of its neighboring higher-quality frames. The MIF-Net is built on the recently developed DenseNet, benefiting from its improved generalization capacity and computational efficiency. In addition, a novel block-adaptive convolutional layer is designed and applied in the MIF-Net, for handling the artifacts influenced by coding tree unit (CTU) structure in HEVC. Extensive experiments show that our MIF approach achieves on average 11.621% saving of the Bjontegaard delta bit-rate (BD-BR) on the standard test set, significantly outperforming the standard in-loop filter in HEVC and other state-of-the-art approaches.

Authors

I am an author on this paper

Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7

Not enough ratings

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

NR-CNN: Nested-Residual Guided CNN In-loop Filtering for Video Coding

Kai Lin, Chuanmin Jia, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

Summary: This article introduces a new deep learning network structure for in-loop filtering in video coding. The proposed method utilizes the correlation between different color components and achieves improvements in both performance and efficiency through techniques such as adaptive granularity optimization and parallel inference pipeline.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2022)