Picture for Gen Li

Gen Li

LA4VLA: Learning to Act without Seeing via Language-Action Pretraining

Add code
Jun 25, 2026
Viaarxiv icon

Diffusion Models Adapt to Low-Dimensional Structure Under Flexible Coefficient Choices

Add code
Jun 22, 2026
Viaarxiv icon

ADAPT: Analytical Disturbance-Aware Policy Training for Humanoid Locomotion

Add code
Jun 15, 2026
Viaarxiv icon

Distilling Drifting Transformers with Representation Autoencoders

Add code
Jun 14, 2026
Viaarxiv icon

GIVE: Grounding Human Gestures in Vision-Language-Action Models

Add code
Jun 11, 2026
Viaarxiv icon

PACT: Learning Diverse Diagnostic Strategies via Privileged Synthesis and Branch Consensus

Add code
Jun 08, 2026
Viaarxiv icon

MIRAGE: Mobile Agents with Implicit Reasoning and Generative World Models

Add code
Jun 03, 2026
Viaarxiv icon

MARS Policy: Multimodality Only When It Matters

Add code
May 28, 2026
Viaarxiv icon

OccamToken: Efficient VLM Inference with Training-Free and Budget-Adaptive Token Pruning

Add code
May 28, 2026
Viaarxiv icon

Gaze2Act: Gaze-Conditioned Vision-Language-Action Policies for Interactive Robot Manipulation

Add code
May 28, 2026
Viaarxiv icon