Picture for Chelsea Finn

Chelsea Finn

MEM: Multi-Scale Embodied Memory for Vision Language Action Models

Add code
Mar 04, 2026
Viaarxiv icon

RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies

Add code
Mar 04, 2026
Viaarxiv icon

VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model

Add code
Feb 15, 2026
Viaarxiv icon

Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment

Add code
Feb 12, 2026
Viaarxiv icon

SteerVLA: Steering Vision-Language-Action Models in Long-Tail Driving Scenarios

Add code
Feb 09, 2026
Viaarxiv icon

TQL: Scaling Q-Functions with Transformers by Preventing Attention Collapse

Add code
Feb 01, 2026
Viaarxiv icon

Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning

Add code
Jan 22, 2026
Viaarxiv icon

RoboReward: General-Purpose Vision-Language Reward Models for Robotics

Add code
Jan 08, 2026
Viaarxiv icon

Emergence of Human to Robot Transfer in Vision-Language-Action Models

Add code
Dec 27, 2025
Viaarxiv icon

PolaRiS: Scalable Real-to-Sim Evaluations for Generalist Robot Policies

Add code
Dec 18, 2025
Viaarxiv icon