SCIENCE CHINA Information Sciences, https://doi.org/10.1007/s11432-019-2803-x

InStereo2K: A large real dataset for stereo matching in indoor scenes

More info


Deep neural networks have shown great success in stereo matching in recent years. On the KITTI datasets, most top performing methods are based on neural networks. However, on the Middlebury datasets, these methods usually do not perform well. The KITTI datasets were collected in outdoor scenes while the Middlebury datasets were collected in indoor scenes. It is commonly believed that the community still lacks a large labelled dataset for stereo matching in indoor scenes. In this paper, we introduce a new stereo dataset called InStereo2K. It contains 2050 pairs of stereo images with highly accurate groundtruth disparity maps, including 2000 pairs for training and 50 pairs for test. Experimental results show that our dataset can significantly improve the performance of several latest networks (including StereoNet and PSMNet) on the Middlebury 2014 dataset. The large scale, high accuracy and rich diversity of the proposed InStereo2K dataset provides new opportunities to researchers in the area of stereo matching and beyond. It also takes end-to-end stereo matching methods a step towards practical applications. The dataset is available at https://github.com/yuhuaxu/stereodataset.

Funded by

This work was supported by National Natural Science Foundation of China (Grant Nos. 61402489, 61972435, 61602499) and Fundamental Research Funds for the Central Universities (No. 18lgzd06).

Copyright 2020  CHINA SCIENCE PUBLISHING & MEDIA LTD.  中国科技出版传媒股份有限公司  版权所有

京ICP备14028887号-23       京公网安备11010102003388号