Picture for Ivan Laptev

Ivan Laptev

WILLOW, LIENS

Learning to Answer Visual Questions from Web Videos

Add code
May 11, 2022
Figure 1 for Learning to Answer Visual Questions from Web Videos
Figure 2 for Learning to Answer Visual Questions from Web Videos
Figure 3 for Learning to Answer Visual Questions from Web Videos
Figure 4 for Learning to Answer Visual Questions from Web Videos
Viaarxiv icon

TubeDETR: Spatio-Temporal Video Grounding with Transformers

Add code
Mar 30, 2022
Figure 1 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 2 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 3 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 4 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Viaarxiv icon

Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos

Add code
Mar 22, 2022
Figure 1 for Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Figure 2 for Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Figure 3 for Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Figure 4 for Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Viaarxiv icon

Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems

Add code
Mar 11, 2022
Figure 1 for Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Figure 2 for Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Figure 3 for Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Figure 4 for Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Viaarxiv icon

Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation

Add code
Feb 23, 2022
Figure 1 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 2 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 3 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 4 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Viaarxiv icon

Are Large-scale Datasets Necessary for Self-Supervised Pre-training?

Add code
Dec 20, 2021
Figure 1 for Are Large-scale Datasets Necessary for Self-Supervised Pre-training?
Figure 2 for Are Large-scale Datasets Necessary for Self-Supervised Pre-training?
Figure 3 for Are Large-scale Datasets Necessary for Self-Supervised Pre-training?
Figure 4 for Are Large-scale Datasets Necessary for Self-Supervised Pre-training?
Viaarxiv icon

Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos

Add code
Nov 02, 2021
Figure 1 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 2 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 3 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Figure 4 for Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
Viaarxiv icon

History Aware Multimodal Transformer for Vision-and-Language Navigation

Add code
Oct 25, 2021
Figure 1 for History Aware Multimodal Transformer for Vision-and-Language Navigation
Figure 2 for History Aware Multimodal Transformer for Vision-and-Language Navigation
Figure 3 for History Aware Multimodal Transformer for Vision-and-Language Navigation
Figure 4 for History Aware Multimodal Transformer for Vision-and-Language Navigation
Viaarxiv icon

Differentiable Rendering with Perturbed Optimizers

Add code
Oct 18, 2021
Figure 1 for Differentiable Rendering with Perturbed Optimizers
Figure 2 for Differentiable Rendering with Perturbed Optimizers
Figure 3 for Differentiable Rendering with Perturbed Optimizers
Figure 4 for Differentiable Rendering with Perturbed Optimizers
Viaarxiv icon

Reconstructing and grounding narrated instructional videos in 3D

Add code
Sep 10, 2021
Figure 1 for Reconstructing and grounding narrated instructional videos in 3D
Figure 2 for Reconstructing and grounding narrated instructional videos in 3D
Figure 3 for Reconstructing and grounding narrated instructional videos in 3D
Figure 4 for Reconstructing and grounding narrated instructional videos in 3D
Viaarxiv icon