☆ 4.6 Article

Cognitive Template-Clustering Improved LineMod for Efficient Multi-object Pose Estimation

COGNITIVE COMPUTATION (2020)

期刊

COGNITIVE COMPUTATION

卷 12, 期 4, 页码 834-843

出版社

SPRINGER

DOI: 10.1007/s12559-020-09717-5

关键词

Muller-Lyer illusion; Cognitive template-clustering; Brain-inspired computation; LineMod; 6D pose estimation

类别

Computer Science, Artificial Intelligence Neurosciences

资金

Beijing Natural Science Foundation [4184103]
National Natural Science Foundation of China [61806195]
Strategic Priority Research Program of Chinese Academy of Sciences [XDB32070100]
Beijing Municipality of Science and Technology [Z181100001518006]
CETC Joint Fund [6141B08010103]
Beijing Academy of Artificial Intelligence (BAAI)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Various types of theoretical algorithms have been proposed for 6D pose estimation, e.g., the point pair method, template matching method, Hough forest method, and deep learning method. However, they are still far from the performance of our natural biological systems, which can undertake 6D pose estimation of multi-objects efficiently, especially with severe occlusion. With the inspiration of the Muller-Lyer illusion in the biological visual system, in this paper, we propose a cognitive template-clustering improved LineMod (CT-LineMod) model. The model uses a 7D cognitive feature vector to replace standard 3D spatial points in the clustering procedure of Patch-LineMod, in which the cognitive distance of different 3D spatial points will be further influenced by the additional 4D information related with direction and magnitude of features in the Muller-Lyer illusion. The 7D vector will be dimensionally reduced into the 3D vector by the gradient-descent method, and then further clustered by K-means to aggregately match templates and automatically eliminate superfluous clusters, which makes the template matching possible on both holistic and part-based scales. The model has been verified on the standard Doumanoglou dataset and demonstrates a state-of-the-art performance, which shows the accuracy and efficiency of the proposed model on cognitive feature distance measurement and template selection on multiple pose estimation under severe occlusion. The powerful feature representation in the biological visual system also includes characteristics of the Muller-Lyer illusion, which, to some extent, will provide guidance towards a biologically plausible algorithm for efficient 6D pose estimation under severe occlusion.

Cognitive Template-Clustering Improved LineMod for Efficient Multi-object Pose Estimation

期刊

COGNITIVE COMPUTATION

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Cognitive Template-Clustering Improved LineMod for Efficient Multi-object Pose Estimation

期刊

COGNITIVE COMPUTATION

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文