Alert button

Compositional Video Synthesis with Action Graphs

Jun 27, 2020
Amir Bar, Roei Herzig, Xiaolong Wang, Gal Chechik, Trevor Darrell, Amir Globerson

Figure 1 for Compositional Video Synthesis with Action Graphs
Figure 2 for Compositional Video Synthesis with Action Graphs
Figure 3 for Compositional Video Synthesis with Action Graphs
Figure 4 for Compositional Video Synthesis with Action Graphs

Share this with someone who'll enjoy it:

Videos of actions are complex spatio-temporal signals, containing rich compositional structures. Current generative models are limited in their ability to generate examples of object configurations outside the range they were trained on. Towards this end, we introduce a generative model (AG2Vid) based on Action Graphs, a natural and convenient structure that represents the dynamics of actions between objects over time. Our AG2Vid model disentangles appearance and position features, allowing for more accurate generation. AG2Vid is evaluated on the CATER and Something-Something datasets and outperforms other baselines. Finally, we show how Action Graphs can be used for generating novel compositions of unseen actions.

View paper onarxiv iconopen_review icon

Share this with someone who'll enjoy it: