Picture for Ivan Laptev

Ivan Laptev

WILLOW, LIENS

Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation

Add code
Dec 20, 2022
Figure 1 for Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation
Figure 2 for Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation
Figure 3 for Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation
Figure 4 for Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation
Viaarxiv icon

Image Compression with Product Quantized Masked Image Modeling

Add code
Dec 14, 2022
Figure 1 for Image Compression with Product Quantized Masked Image Modeling
Figure 2 for Image Compression with Product Quantized Masked Image Modeling
Figure 3 for Image Compression with Product Quantized Masked Image Modeling
Figure 4 for Image Compression with Product Quantized Masked Image Modeling
Viaarxiv icon

Multi-Task Learning of Object State Changes from Uncurated Videos

Add code
Nov 24, 2022
Figure 1 for Multi-Task Learning of Object State Changes from Uncurated Videos
Figure 2 for Multi-Task Learning of Object State Changes from Uncurated Videos
Figure 3 for Multi-Task Learning of Object State Changes from Uncurated Videos
Figure 4 for Multi-Task Learning of Object State Changes from Uncurated Videos
Viaarxiv icon

Language Conditioned Spatial Relation Reasoning for 3D Object Grounding

Add code
Nov 17, 2022
Viaarxiv icon

Instruction-driven history-aware policies for robotic manipulations

Add code
Sep 22, 2022
Figure 1 for Instruction-driven history-aware policies for robotic manipulations
Figure 2 for Instruction-driven history-aware policies for robotic manipulations
Figure 3 for Instruction-driven history-aware policies for robotic manipulations
Figure 4 for Instruction-driven history-aware policies for robotic manipulations
Viaarxiv icon

Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control

Add code
Sep 19, 2022
Figure 1 for Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control
Figure 2 for Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control
Figure 3 for Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control
Figure 4 for Enforcing the consensus between Trajectory Optimization and Policy Learning for precise robot control
Viaarxiv icon

Learning from Unlabeled 3D Environments for Vision-and-Language Navigation

Add code
Aug 24, 2022
Figure 1 for Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Figure 2 for Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Figure 3 for Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Figure 4 for Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Viaarxiv icon

AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction

Add code
Jul 26, 2022
Figure 1 for AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction
Figure 2 for AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction
Figure 3 for AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction
Figure 4 for AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction
Viaarxiv icon

Augmenting differentiable physics with randomized smoothing

Add code
Jun 23, 2022
Figure 1 for Augmenting differentiable physics with randomized smoothing
Figure 2 for Augmenting differentiable physics with randomized smoothing
Figure 3 for Augmenting differentiable physics with randomized smoothing
Figure 4 for Augmenting differentiable physics with randomized smoothing
Viaarxiv icon

Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Add code
Jun 16, 2022
Figure 1 for Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Figure 2 for Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Figure 3 for Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Figure 4 for Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Viaarxiv icon