☆ 4.6 Article

Exploring Deep Learning for View-Based 3D Model Retrieval

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2020)

期刊

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS

卷 16, 期 1, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3377876

关键词

3D model retrieval; benchmark; deep learning features; handcrafted feature

类别

Computer Science, Information Systems Computer Science, Software Engineering Computer Science, Theory & Methods

资金

National Natural Science Foundation of China [61872270, 61572357]
National Key R&D Program of China [2019YFBB1404700]
Jinan's innovation team [2018GXRC014]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

In recent years, view-based 3D model retrieval has become one of the research focuses in the field of computer vision and machine learning. In fact, the 3D model retrieval algorithm consists of feature extraction and similarity measurement, and the robust features play a decisive role in the similarity measurement. Although deep learning has achieved comprehensive success in the field of computer vision, deep learning features are used for 3D model retrieval only in a small number of works. To the best of our knowledge, there is no benchmark to evaluate these deep learning features. To tackle this problem, in this work we systematically evaluate the performance of deep learning features in view-based 3D model retrieval on four popular datasets (ETH, NTU60, PSB, and MVRED) by different kinds of similarity measure methods. In detail, the performance of hand-crafted features and deep learning features are compared, and then the robustness of deep learning features is assessed. Finally, the difference between single-view deep learning features and multi-view deep learning features is also evaluated. By quantitatively analyzing the performances on different datasets, it is clear that these deep learning features can consistently outperform all of the hand-crafted features, and they are also more robust than the hand-crafted features when different degrees of noise are added into the image. The exploration of latent relationships among different views in multi-view deep learning network architectures shows that the performance of multi-view deep learning outperforms that of single-view deep learning features with low computational complexity.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.6

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Image Matching from Handcrafted to Deep Features: A Survey

Jiayi Ma, Xingyu Jiang, Aoxiang Fan, Junjun Jiang, Junchi Yan

Summary: Image matching is a fundamental task in various visual applications, and with the development of deep learning techniques, there has been an increasing number of methods proposed in this field. However, the challenge remains in choosing the suitable method for specific applications and designing image matching methods with superior performance. This comprehensive review and analysis provide insights into classical and latest techniques, and offer prospects for future development in image matching technologies.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2021)