4.7 Review

Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies

Journal

PROCEEDINGS OF THE IEEE
Volume 109, Issue 9, Pages 1494-1520

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JPROC.2021.3059994

Keywords

Encoding; Video compression; Video coding; Streaming media; Visualization; Quality of experience; Spatiotemporal phenomena; Deep learning; Neural networks; Adaptive filters; deep neural networks (DNNs); neural video coding; texture analysis

Ask authors/readers for more resources

In recent years, significant advances have been made in video compression systems, and artificial intelligence technology shows great potential to further increase efficiency in this field. Preprocessing, coding, and postprocessing are identified as key functional blocks that have been continuously investigated to enhance user experience.
Significant advances in video compression systems have been made in the past several decades to satisfy the near-exponential growth of Internet-scale video traffic. From the application perspective, we have identified three major functional blocks, including preprocessing, coding, and postprocessing, which have been continuously investigated to maximize the end-user quality of experience (QoE) under a limited bit rate budget. Recently, artificial intelligence (AI)-powered techniques have shown great potential to further increase the efficiency of the aforementioned functional blocks, both individually and jointly. In this article, we review recent technical advances in video compression systems extensively, with an emphasis on deep neural network (DNN)-based approaches, and then present three comprehensive case studies. On preprocessing, we show a switchable texture-based video coding example that leverages DNN-based scene understanding to extract semantic areas for the improvement of a subsequent video coder. On coding, we present an end-to-end neural video coding framework that takes advantage of the stacked DNNs to efficiently and compactly code input raw videos via fully data-driven learning. On postprocessing, we demonstrate two neural adaptive filters to, respectively, facilitate the in-loop and postfiltering for the enhancement of compressed frames. Finally, a companion website hosting the contents developed in this work can be accessed publicly at https://purdueviper.github.io/dnn-coding/.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available