Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jure Žbontar

Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

May 18, 2016

Jure Žbontar, Yann LeCun

Figure 1 for Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Figure 2 for Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Figure 3 for Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Figure 4 for Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches

Abstract:We present a method for extracting depth information from a rectified image pair. Our approach focuses on the first stage of many stereo algorithms: the matching cost computation. We approach the problem by learning a similarity measure on small image patches using a convolutional neural network. Training is carried out in a supervised manner by constructing a binary classification data set with examples of similar and dissimilar pairs of patches. We examine two network architectures for this task: one tuned for speed, the other for accuracy. The output of the convolutional neural network is used to initialize the stereo matching cost. A series of post-processing steps follow: cross-based cost aggregation, semiglobal matching, a left-right consistency check, subpixel enhancement, a median filter, and a bilateral filter. We evaluate our method on the KITTI 2012, KITTI 2015, and Middlebury stereo data sets and show that it outperforms other approaches on all three data sets.

* JMLR 17(65):1-32, 2016

Via

Access Paper or Ask Questions

Computing the Stereo Matching Cost with a Convolutional Neural Network

Oct 20, 2015

Jure Žbontar, Yann LeCun

Figure 1 for Computing the Stereo Matching Cost with a Convolutional Neural Network

Figure 2 for Computing the Stereo Matching Cost with a Convolutional Neural Network

Figure 3 for Computing the Stereo Matching Cost with a Convolutional Neural Network

Figure 4 for Computing the Stereo Matching Cost with a Convolutional Neural Network

Abstract:We present a method for extracting depth information from a rectified image pair. We train a convolutional neural network to predict how well two image patches match and use it to compute the stereo matching cost. The cost is refined by cross-based cost aggregation and semiglobal matching, followed by a left-right consistency check to eliminate errors in the occluded regions. Our stereo method achieves an error rate of 2.61 % on the KITTI stereo dataset and is currently (August 2014) the top performing method on this dataset.

* Conference on Computer Vision and Pattern Recognition (CVPR), June 2015

Via

Access Paper or Ask Questions