Single Image Depth Estimation Based on Encoder-Decoder Convolution Neural Network

doi:10.11996/JG.j.2095-302X.2019040718

Journal of Graphics

Previous Articles Next Articles

Single Image Depth Estimation Based on Encoder-Decoder Convolution Neural Network

(School of Information Science and Technology, North China University of Technology, Beijing 100144, China)

Online:2019-08-31 Published:2019-08-30

Abstract

Abstract: Abstract: Focusing on the poor robustness and lower accuracy in traditional methods of estimating depth in monocular vision, a method based on convolution neural network (CNN) is proposed for predicting depth from a single image. At first, fused-layers encoder-decoder network is presented. This network is an improvement of the end-to-end encoder-decoder network structure. Fused-layers block is added to encoder network, and the network utilization of multi-scale information is improved by this block with fusing multi-layers feature. Then, a multi-receptive field res-block is proposed, which is the main component of the decoder and used for estimating depth from high-level semantic information. Meanwhile, the network capacity of multi-scale feature extraction is enhanced because the size of receptive field is flexible to change in multi-receptive field res-block. The validation of proposed network is conducted on NYUD v2 dataset, and compared with multi-scale convolution neural network, experimental results show that the accuracy of proposed method is improved by about 4.4% in δ<1.25 and average relative error is reduced by about 8.2%. The feasibility of proposed method in estimating depth from a single image is proved.

Key words: Keywords: CNN, encoder-decoder, depth estimation, monocular vision

JIA Rui-ming, LIU Li-qiang, LIU Sheng-jie, CUI Jia-li . Single Image Depth Estimation Based on Encoder-Decoder Convolution Neural Network[J]. Journal of Graphics, DOI: 10.11996/JG.j.2095-302X.2019040718.

[1]	LIAO Zhi-wei , JIN Jing , ZHANG Chao-fan, YANG Xue-zhi. Monocular depth estimation of ASPP networks based on hierarchical compress excitation [J]. Journal of Graphics, 2022, 43(2): 214-222.
[2]	MU Qi , , ZHANG Han , HE Zhi-qiang , LI Zhan-li . Scale adaptive target tracking algorithm based on depth estimation and#br# feature fusion [J]. Journal of Graphics, 2021, 42(4): 563-571.
[3]	JIANG Su-qin , ZHANG Meng-jun , LI Wei-qing , SU Zhi-yong . Real-time virtual and real occlusion processing technology based on voting decision [J]. Journal of Graphics, 2021, 42(4): 629-635.
[4]	HE Ye, ZHANG Xu-dong, WU Di. FANET: light field depth estimation with multi-channel information fusion [J]. Journal of Graphics, 2020, 41(6): 922-929.
[5]	WEN Jing, AN Guo-yan, LIANG Yu-dong . Monocular Image Depth Estimation Based on CNN Features Extraction and Weighted Transfer Learning [J]. Journal of Graphics, 2019, 40(2): 248-255.

Single Image Depth Estimation Based on Encoder-Decoder Convolution Neural Network

PDF (PC)

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 5

Recommended Articles

Metrics

Comments