4.7 Article

Scene Understanding by Reasoning Stability and Safety

期刊

INTERNATIONAL JOURNAL OF COMPUTER VISION
卷 112, 期 2, 页码 221-238

出版社

SPRINGER
DOI: 10.1007/s11263-014-0795-4

关键词

-

资金

  1. MURI ONR [N00014-10-1-0933]
  2. DARPA MSEE, USA [FA 8650-11-1-7149]
  3. Next-generation Energies for Tohoku Recovery (NET)
  4. SCOPE Program of Ministry of Internal Affairs and Communications, Japan
  5. Microsoft Japan
  6. Direct For Computer & Info Scie & Enginr
  7. Div Of Information & Intelligent Systems [1423305] Funding Source: National Science Foundation

向作者/读者索取更多资源

This paper presents a new perspective for 3D scene understanding by reasoning object stability and safety using intuitive mechanics. Our approach utilizes a simple observation that, by human design, objects in static scenes should be stable in the gravity field and be safe with respect to various physical disturbances such as human activities. This assumption is applicable to all scene categories and poses useful constraints for the plausible interpretations (parses) in scene understanding. Given a 3D point cloud captured for a static scene by depth cameras, our method consists of three steps: (i) recovering solid 3D volumetric primitives from voxels; (ii) reasoning stability by grouping the unstable primitives to physically stable objects by optimizing the stability and the scene prior; and (iii) reasoning safety by evaluating the physical risks for objects under physical disturbances, such as human activity, wind or earthquakes. We adopt a novel intuitive physics model and represent the energy landscape of each primitive and object in the scene by a disconnectivity graph (DG). We construct a contact graph with nodes being 3D volumetric primitives and edges representing the supporting relations. Then we adopt a Swendson-Wang Cuts algorithm to partition the contact graph into groups, each of which is a stable object. In order to detect unsafe objects in a static scene, our method further infers hidden and situated causes (disturbances) in the scene, and then introduces intuitive physical mechanics to predict possible effects (e.g., falls) as consequences of the disturbances. In experiments, we demonstrate that the algorithm achieves a substantially better performance for (i) object segmentation, (ii) 3D volumetric recovery, and (iii) scene understanding with respect to other state-of-the-art methods. We also compare the safety prediction from the intuitive mechanics model with human judgement.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据