Picture for Ivan Laptev

Ivan Laptev

WILLOW, LIENS

ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Add code
Dec 02, 2024
Viaarxiv icon

MALT: Improving Reasoning with Multi-Agent LLM Training

Add code
Dec 02, 2024
Viaarxiv icon

MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation

Add code
Nov 26, 2024
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Viaarxiv icon

Mitigating Object Hallucination via Concentric Causal Attention

Add code
Oct 21, 2024
Viaarxiv icon

Learning feasible transitions for efficient contact planning

Add code
Jul 16, 2024
Viaarxiv icon

Short Film Dataset : A Benchmark for Story-Level Video Understanding

Add code
Jun 14, 2024
Viaarxiv icon

MirrorCheck: Efficient Adversarial Defense for Vision-Language Models

Add code
Jun 13, 2024
Viaarxiv icon

ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos

Add code
Apr 24, 2024
Viaarxiv icon

SUGAR: Pre-training 3D Visual Representations for Robotics

Add code
Apr 01, 2024
Viaarxiv icon