Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Analyzing and Improving the Pyramidal Predictive Network for Future Video Frame Prediction

Jan 13, 2023

Chaofan Ling, Weihua Li, Junpei Zhong

Figure 1 for Analyzing and Improving the Pyramidal Predictive Network for Future Video Frame Prediction

Figure 2 for Analyzing and Improving the Pyramidal Predictive Network for Future Video Frame Prediction

Figure 3 for Analyzing and Improving the Pyramidal Predictive Network for Future Video Frame Prediction

Figure 4 for Analyzing and Improving the Pyramidal Predictive Network for Future Video Frame Prediction

Share this with someone who'll enjoy it:

Abstract:The pyramidal predictive network (PPNV1) proposes an interesting temporal pyramid architecture and yields promising results on the task of future video-frame prediction. We expose and analyze its signal dissemination and characteristic artifacts, and propose corresponding improvements in model architecture and training strategies to address them. Although the PPNV1 theoretically mimics the workings of human brain, its careless signal processing leads to aliasing in the network. We redesign the network architecture to solve the problems. In addition to improving the unreasonable information dissemination, the new architecture also aims to solve the aliasing in neural networks. Different inputs are no longer simply concatenated, and the downsampling and upsampling components have also been redesigned to ensure that the network can more easily construct images from Fourier features of low-frequency inputs. Finally, we further improve the training strategies, to alleviate the problem of input inconsistency during training and testing. Overall, the improved model is more interpretable, stronger, and the quality of its predictions is better. Code is available at https://github.com/Ling-CF/PPNV2.

View paper on

Share this with someone who'll enjoy it:

Title:Analyzing and Improving the Pyramidal Predictive Network for Future Video Frame Prediction

Paper and Code