7 February 2019 Structure-preserving video super-resolution using three-dimensional convolutional neural networks
Author Affiliations +
Abstract
Convolutional neural networks (CNN) have given rise to a new generation of video super-resolution (SR) technique. However, most existing CNN-based video SR algorithms treat the consecutive frames as a series of feature maps, just as the procedure performed in single image SR algorithms. We propose an end-to-end three-dimensional (3-D) CNN video SR framework. The input frames are considered as a cube in our framework. 3-D convolution is performed on it to extract features along spatial and temporal dimension. Image prior knowledge, such as optical flows, is introduced in reconstruction. A combination of mean square error loss and multiscale structure similarity index (MS-SSIM) loss is used to optimize the model. Experimental results show that the proposed method reconstructs high-resolution frames with more accurate and visually pleasant structures compared with state-of-the-art video SR algorithms. We also achieve comparable PSNR/SSIM results with less computation time.
© 2019 SPIE and IS&T 1017-9909/2019/$25.00 © 2019 SPIE and IS&T
Chenyu Liu, Xueming Li, Xianlin Zhang, and Xianlin Zhang "Structure-preserving video super-resolution using three-dimensional convolutional neural networks," Journal of Electronic Imaging 28(2), 021007 (7 February 2019). https://doi.org/10.1117/1.JEI.28.2.021007
Received: 8 September 2018; Accepted: 9 January 2019; Published: 7 February 2019
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Lawrencium

Super resolution

Convolution

Optical flow

Convolutional neural networks

Reconstruction algorithms

Back to Top