Picture for Ivan Laptev

Ivan Laptev

WILLOW, LIENS

A1: A Fully Transparent Open-Source, Adaptive and Efficient Truncated Vision-Language-Action Model

Add code
Apr 07, 2026
Viaarxiv icon

ManipArena: Comprehensive Real-world Evaluation of Reasoning-Oriented Generalist Robot Manipulation

Add code
Mar 30, 2026
Viaarxiv icon

MessyKitchens: Contact-rich object-level 3D scene reconstruction

Add code
Mar 17, 2026
Viaarxiv icon

PhysMoDPO: Physically-Plausible Humanoid Motion with Preference Optimization

Add code
Mar 16, 2026
Viaarxiv icon

World2Act: Latent Action Post-Training via Skill-Compositional World Models

Add code
Mar 11, 2026
Viaarxiv icon

Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos

Add code
Mar 10, 2026
Viaarxiv icon

Choose What to Observe: Task-Aware Semantic-Geometric Representations for Visuomotor Policy

Add code
Mar 09, 2026
Viaarxiv icon

GLaD: Geometric Latent Distillation for Vision-Language-Action Models

Add code
Dec 10, 2025
Figure 1 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 2 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 3 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 4 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Viaarxiv icon

BLAZER: Bootstrapping LLM-based Manipulation Agents with Zero-Shot Data Generation

Add code
Oct 09, 2025
Viaarxiv icon

Learning to Generate Object Interactions with Physics-Guided Video Diffusion

Add code
Oct 02, 2025
Viaarxiv icon