Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Taylor saves for later: disentanglement for video prediction using Taylor representation

May 24, 2021

Ting Pan, Zhuqing Jiang, Jianan Han, Shiping Wen, Aidong Men, Haiying Wang

Figure 1 for Taylor saves for later: disentanglement for video prediction using Taylor representation

Figure 2 for Taylor saves for later: disentanglement for video prediction using Taylor representation

Figure 3 for Taylor saves for later: disentanglement for video prediction using Taylor representation

Figure 4 for Taylor saves for later: disentanglement for video prediction using Taylor representation

Share this with someone who'll enjoy it:

Abstract:Video prediction is a challenging task with wide application prospects in meteorology and robot systems. Existing works fail to trade off short-term and long-term prediction performances and extract robust latent dynamics laws in video frames. We propose a two-branch seq-to-seq deep model to disentangle the Taylor feature and the residual feature in video frames by a novel recurrent prediction module (TaylorCell) and residual module. TaylorCell can expand the video frames' high-dimensional features into the finite Taylor series to describe the latent laws. In TaylorCell, we propose the Taylor prediction unit (TPU) and the memory correction unit (MCU). TPU employs the first input frame's derivative information to predict the future frames, avoiding error accumulation. MCU distills all past frames' information to correct the predicted Taylor feature from TPU. Correspondingly, the residual module extracts the residual feature complementary to the Taylor feature. On three generalist datasets (Moving MNIST, TaxiBJ, Human 3.6), our model outperforms or reaches state-of-the-art models, and ablation experiments demonstrate the effectiveness of our model in long-term prediction.

View paper on

Share this with someone who'll enjoy it:

Title:Taylor saves for later: disentanglement for video prediction using Taylor representation

Paper and Code