4.7 Review

Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies

期刊

PROCEEDINGS OF THE IEEE
卷 109, 期 9, 页码 1494-1520

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JPROC.2021.3059994

关键词

Encoding; Video compression; Video coding; Streaming media; Visualization; Quality of experience; Spatiotemporal phenomena; Deep learning; Neural networks; Adaptive filters; deep neural networks (DNNs); neural video coding; texture analysis

向作者/读者索取更多资源

In recent years, significant advances have been made in video compression systems, and artificial intelligence technology shows great potential to further increase efficiency in this field. Preprocessing, coding, and postprocessing are identified as key functional blocks that have been continuously investigated to enhance user experience.
Significant advances in video compression systems have been made in the past several decades to satisfy the near-exponential growth of Internet-scale video traffic. From the application perspective, we have identified three major functional blocks, including preprocessing, coding, and postprocessing, which have been continuously investigated to maximize the end-user quality of experience (QoE) under a limited bit rate budget. Recently, artificial intelligence (AI)-powered techniques have shown great potential to further increase the efficiency of the aforementioned functional blocks, both individually and jointly. In this article, we review recent technical advances in video compression systems extensively, with an emphasis on deep neural network (DNN)-based approaches, and then present three comprehensive case studies. On preprocessing, we show a switchable texture-based video coding example that leverages DNN-based scene understanding to extract semantic areas for the improvement of a subsequent video coder. On coding, we present an end-to-end neural video coding framework that takes advantage of the stacked DNNs to efficiently and compactly code input raw videos via fully data-driven learning. On postprocessing, we demonstrate two neural adaptive filters to, respectively, facilitate the in-loop and postfiltering for the enhancement of compressed frames. Finally, a companion website hosting the contents developed in this work can be accessed publicly at https://purdueviper.github.io/dnn-coding/.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据