1. School of Electrical Engineering and Automation, Hefei University of Technology , Hefei China 230009
2. Orbbec, Nanshan, Shenzhen City, Guangdong Province, China , Shenzhen China 518000
3. School of Electronics and Communication Engineering, Sun Yat-sen University, Guangzhou , Guangzhou China 510006
4. School of Aeronautics and Astronautics, Sun Yat-sen University , Guangzhou China 510006
Deep neural networks have shown great success in stereo matching in recent years. On the KITTI datasets, most top performing methods are based on neural networks. However, on the Middlebury datasets, these methods usually do not perform well. The KITTI datasets were collected in outdoor scenes while the Middlebury datasets were collected in indoor scenes. It is commonly believed that the community still lacks a large labelled dataset for stereo matching in indoor scenes. In this paper, we introduce a new stereo dataset called InStereo2K. It contains 2050 pairs of stereo images with highly accurate groundtruth disparity maps, including 2000 pairs for training and 50 pairs for test. Experimental results show that our dataset can significantly improve the performance of several latest networks (including StereoNet and PSMNet) on the Middlebury 2014 dataset. The large scale, high accuracy and rich diversity of the proposed InStereo2K dataset provides new opportunities to researchers in the area of stereo matching and beyond. It also takes end-to-end stereo matching methods a step towards practical applications. The dataset is available at https://github.com/yuhuaxu/stereodataset.
This work was supported by National Natural Science Foundation of China (Grant Nos. 61402489, 61972435, 61602499) and Fundamental Research Funds for the Central Universities (No. 18lgzd06).
Copyright 2020 CHINA SCIENCE PUBLISHING & MEDIA LTD. 中国科技出版传媒股份有限公司 版权所有