Picture for Ivan Laptev

Ivan Laptev

WILLOW, LIENS

ScanEdit: Hierarchically-Guided Functional 3D Scan Editing

Add code
Apr 21, 2025
Viaarxiv icon

DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding

Add code
Mar 13, 2025
Viaarxiv icon

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Add code
Jan 10, 2025
Figure 1 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 2 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 3 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Figure 4 for LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Viaarxiv icon

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Add code
Dec 11, 2024
Viaarxiv icon

ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Add code
Dec 02, 2024
Viaarxiv icon

MALT: Improving Reasoning with Multi-Agent LLM Training

Add code
Dec 02, 2024
Figure 1 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 2 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 3 for MALT: Improving Reasoning with Multi-Agent LLM Training
Viaarxiv icon

MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation

Add code
Nov 26, 2024
Figure 1 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 2 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 3 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 4 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

Mitigating Object Hallucination via Concentric Causal Attention

Add code
Oct 21, 2024
Viaarxiv icon

Learning feasible transitions for efficient contact planning

Add code
Jul 16, 2024
Viaarxiv icon