☆ 4.5 Article

Temporal capsule networks for video motion estimation and error concealment

SIGNAL IMAGE AND VIDEO PROCESSING (2020)

期刊

SIGNAL IMAGE AND VIDEO PROCESSING

卷 14, 期 7, 页码 1369-1377

出版社

SPRINGER LONDON LTD

DOI: 10.1007/s11760-020-01671-x

关键词

Capsule networks; Conv3D; ConvLSTM; Error concealment; Motion estimation

类别

Engineering, Electrical & Electronic Imaging Science & Photographic Technology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In this paper, we present a temporal capsule network architecture to encode motion in videos as an instantiation parameter. The extracted motion is used to perform motion-compensated error concealment. We modify the original architecture and use a carefully curated dataset to enable the training of capsules spatially and temporally. First, we add the temporal dimension by taking co-located patches from three consecutive frames obtained from standard video sequences to form input data cubes. Second, the network is designed with an initial feature extraction layer that operates on all three dimensions to generate spatiotemporal features. Additionally, we implement the PrimaryCaps module with a recurrent layer, instead of a conventional convolutional layer, to extract short-term motion-related temporal dependencies and encode them as activation vectors in the capsule output. Finally, the capsule output is combined with the most-recent past frame and passed through a fully connected reconstruction network to perform motion-compensated error concealment. We study the effectiveness of temporal capsules by comparing the proposed model with architectures that do not include capsules. Although the quality of the reconstruction shows room for improvement, we successfully demonstrate that capsules-based architectures can be designed to operate in the temporal dimension to encode motion-related attributes as instantiation parameters. The accuracy of motion estimation is evaluated by comparing both the reconstructed frame outputs and the corresponding optical flow estimates with ground truth data.

Temporal capsule networks for video motion estimation and error concealment

期刊

SIGNAL IMAGE AND VIDEO PROCESSING

出版社

SPRINGER LONDON LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Temporal capsule networks for video motion estimation and error concealment

期刊

SIGNAL IMAGE AND VIDEO PROCESSING

出版社

SPRINGER LONDON LTD

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文