4.4 Article

Depth estimation for advancing intelligent transport systems based on self-improving pyramid stereo network

Journal

IET INTELLIGENT TRANSPORT SYSTEMS
Volume 14, Issue 5, Pages 338-345

Publisher

WILEY
DOI: 10.1049/iet-its.2019.0462

Keywords

computer vision; stereo image processing; neural nets; learning (artificial intelligence); intelligent transport systems; pyramid stereo network; autonomous driving; stereo vision-based depth estimation technology; stereo depth estimation problem; deep learning model; convolutional neural networks; strong adaptive capabilities; ground truth depth; training data; complicated post-processing; ill-posed area; online learning; data limitation problem

Funding

  1. National Key Research and Development Program of China [2018YFB1308000]
  2. National Natural Science Funds of China [U1913202, U1813205, U1713213, 61772508, 61907009]
  3. Natural Science Foundation of Guangdong Province [2018A030313802]
  4. Shenzhen Technology Project [JCYJ20180507182610734, JCYJ20170413152535587]
  5. CAS Key Technology Talent Program

Ask authors/readers for more resources

In autonomous driving, stereo vision-based depth estimation technology can help to estimate the distance of obstacles accurately, which is crucial for correctly planning the path of the vehicle. Recent work has formulated the stereo depth estimation problem into a deep learning model with convolutional neural networks. However, these methods need a lot of post-processing and do not have strong adaptive capabilities to ill-posed regions or new scenes. In addition, due to the difficulty of the labelling the ground truth depth for real circumstance, training data for the system is limited. To overcome the above problems, the authors came up with self-improving pyramid stereo network, which can not only get a direct regression disparity without complicated post-processing but also be robust in ill-posed area. Moreover, by online learning, the proposed model can not only address the data limitation problem but also save the time spent on training and hardware resources in practice. At the same time, the proposed model has a self-improving ability to new scenes, which can quickly adjust the model according to the test data in time and improve the accuracy of prediction. Experiments on Scene Flow and KITTI data set demonstrate the effectiveness of the proposed network.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available