Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2022, Vol. 43 ›› Issue (1): 53-59.DOI: 10.11996/JG.j.2095-302X.2022010053

• Image Processing and Computer Vision • Previous Articles     Next Articles

Video super-resolution reconstruction based on multi-scale time domain 3D convolution 

  

  1. 1. Qian Xuesen Space Technology Laboratory, Beijing 100086, China;  2. College of Software, Henan University, Kaifeng Henan 475004, China;  3. Department of Electronic Engineering, Tsinghua University, Beijing 100084, China 
  • Online:2022-02-28 Published:2022-02-16
  • Supported by:
    Key R&D Program of the Ministry of Science and Technology (2020YFA0714100) 

Abstract: Video super-resolution was a work of great practical value. In view of the lack of high-resolution resources in the ultra-high-definition industry, to efficiently utilize the rich temporal correlation information and spatial information between video sequence frames, a video super-resolution reconstruction algorithm based on multi-scale time-domain 3D convolution was proposed. The algorithm extracted the spatiotemporal features of the input low-resolution video sequence frames through the 3D convolution of different time scales. 3D convolution can simultaneously model space and time, which is more suitable for processing video tasks than 2D convolution. After the adaptive motion compensation of two spatio-temporal features extracted in different scales and time domains, the sub-pixel convolutional layer performed resolution enhancement, which was added to the up-sampled input frame to obtain the final reconstructed high-resolution image. The experimental results on the standard data set show that the algorithm can significantly boost visual effects and objective quality evaluation indicators such as peak signal-to-noise ratio and structural similarity, outperforming algorithms such as FSRCNN and EDSR. 

Key words: video super-resolution, deep learning, 3D convolution, multi-scale time domain features, sub-pixel convolution 

CLC Number: