Abstract
We propose an encoder–decoder with densely convolutional networks model to recover the depth information from a single RGB image without the need for depth sensors. The encoder part serves to extract the most representative information from the original data through a series of convolution operations and to reduce the resolution of the spatial input feature. We use the decoder section to produce an upsampling structure that improves the output resolution. Our model is trained from scratch, without any special tuning process, and uses a new optimization function to adaptively learn the rate. We demonstrate the effectiveness of the method by evaluating both indoor and outdoor scenes, and the experimental results show that our proposed approach is more accurate than competing methods.
© 2019 Optical Society of America
Full Article | PDF ArticleMore Like This
Shiyuan Liu, Jingfan Fan, Dengpan Song, Tianyu Fu, Yucong Lin, Deqiang Xiao, Hong Song, Yongtian Wang, and Jian Yang
Biomed. Opt. Express 13(5) 2707-2727 (2022)
Huachun Wang, Xinzhu Sang, Duo Chen, Peng Wang, Xiaoqian Ye, Shuai Qi, and Binbin Yan
Appl. Opt. 61(7) D7-D14 (2022)
Xiao Liang, Jingshuang Sun, Xuewei Wang, Jie Li, Lianpeng Zhang, and Jingbo Guo
J. Opt. Soc. Am. A 40(6) 1237-1248 (2023)