Picture for Rohit Girdhar

Rohit Girdhar

Jack

Forward Prediction for Physical Reasoning

Add code
Jun 18, 2020
Figure 1 for Forward Prediction for Physical Reasoning
Figure 2 for Forward Prediction for Physical Reasoning
Figure 3 for Forward Prediction for Physical Reasoning
Figure 4 for Forward Prediction for Physical Reasoning
Viaarxiv icon

Video Understanding as Machine Translation

Add code
Jun 12, 2020
Figure 1 for Video Understanding as Machine Translation
Figure 2 for Video Understanding as Machine Translation
Figure 3 for Video Understanding as Machine Translation
Figure 4 for Video Understanding as Machine Translation
Viaarxiv icon

Are we asking the right questions in MovieQA?

Add code
Nov 08, 2019
Figure 1 for Are we asking the right questions in MovieQA?
Figure 2 for Are we asking the right questions in MovieQA?
Figure 3 for Are we asking the right questions in MovieQA?
Figure 4 for Are we asking the right questions in MovieQA?
Viaarxiv icon

CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning

Add code
Oct 10, 2019
Figure 1 for CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Figure 2 for CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Figure 3 for CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Figure 4 for CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Viaarxiv icon

MetaPix: Few-Shot Video Retargeting

Add code
Oct 10, 2019
Figure 1 for MetaPix: Few-Shot Video Retargeting
Figure 2 for MetaPix: Few-Shot Video Retargeting
Figure 3 for MetaPix: Few-Shot Video Retargeting
Figure 4 for MetaPix: Few-Shot Video Retargeting
Viaarxiv icon

DistInit: Learning Video Representations without a Single Labeled Video

Add code
Jan 26, 2019
Figure 1 for DistInit: Learning Video Representations without a Single Labeled Video
Figure 2 for DistInit: Learning Video Representations without a Single Labeled Video
Figure 3 for DistInit: Learning Video Representations without a Single Labeled Video
Figure 4 for DistInit: Learning Video Representations without a Single Labeled Video
Viaarxiv icon

Video Action Transformer Network

Add code
Dec 06, 2018
Figure 1 for Video Action Transformer Network
Figure 2 for Video Action Transformer Network
Figure 3 for Video Action Transformer Network
Figure 4 for Video Action Transformer Network
Viaarxiv icon

A Better Baseline for AVA

Add code
Jul 26, 2018
Figure 1 for A Better Baseline for AVA
Figure 2 for A Better Baseline for AVA
Figure 3 for A Better Baseline for AVA
Figure 4 for A Better Baseline for AVA
Viaarxiv icon

Detect-and-Track: Efficient Pose Estimation in Videos

Add code
May 02, 2018
Figure 1 for Detect-and-Track: Efficient Pose Estimation in Videos
Figure 2 for Detect-and-Track: Efficient Pose Estimation in Videos
Figure 3 for Detect-and-Track: Efficient Pose Estimation in Videos
Figure 4 for Detect-and-Track: Efficient Pose Estimation in Videos
Viaarxiv icon

Binge Watching: Scaling Affordance Learning from Sitcoms

Add code
Apr 09, 2018
Figure 1 for Binge Watching: Scaling Affordance Learning from Sitcoms
Figure 2 for Binge Watching: Scaling Affordance Learning from Sitcoms
Figure 3 for Binge Watching: Scaling Affordance Learning from Sitcoms
Figure 4 for Binge Watching: Scaling Affordance Learning from Sitcoms
Viaarxiv icon