4.7 Article

Extracting full-field subpixel structural displacements from videos via deep learning

期刊

JOURNAL OF SOUND AND VIBRATION
卷 505, 期 -, 页码 -

出版社

ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
DOI: 10.1016/j.jsv.2021.116142

关键词

Displacement measurement; Video camera; Phase-based displacement extraction; Convolution neural networks; Subpixel motion field

资金

  1. American Association of Railroads [20-0701-007532, AWD000 02881 (7152981)]
  2. Physics of AI program of Defense Advanced Research Projects Agency (DARPA)

向作者/读者索取更多资源

This study developed a deep learning framework based on convolutional neural networks for real-time extraction of full-field subpixel structural displacements from videos, overcoming the limitations of traditional displacement sensing techniques. Results showed that the trained networks have generalizability to accurately extract full-field subpixel displacements for pixels with sufficient texture contrast.
Conventional displacement sensing techniques (e.g., laser, linear variable differential transformer) have been widely used in structural health monitoring in the past two decades. Though these techniques are capable of measuring displacement time histories with high accuracy, distinct shortcoming remains such as point-to-point contact sensing which limits its applicability in real-world problems. Video cameras have been widely used in the past years due to advantages that include low price, agility, high spatial sensing resolution, and non-contact. Compared with target tracking approaches (e.g., digital image correlation, template matching, etc.), the phase-based method is powerful for detecting small subpixel motions without the use of paints or markers on the structure surface. Nevertheless, the complex computational procedure limits its real-time inference capacity. To address this fundamental issue, we develop a deep learning framework based on convolutional neural networks (CNNs) that enable real-time extraction of full-field subpixel structural displacements from videos. In particular, two new CNN architectures are designed and trained on a dataset generated by the phase-based motion extraction method from a single lab-recorded high-speed video of a dynamic structure. As displacement is only reliable in the regions with sufficient texture contrast, the sparsity of motion field induced by the texture mask is considered via the network architecture design and loss function definition. Results show that, with the supervision of full and sparse motion field, the trained network is capable of identifying the pixels with sufficient texture contrast as well as their subpixel motions. The performance of the trained networks is tested on various videos of other structures to extract the full-field motion (e.g., displacement time histories), which indicates that the trained networks have generalizability to accurately extract full-field subpixel displacements for pixels with sufficient texture contrast. (c) 2021 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据