Picture for Jinkui Shi

Jinkui Shi

MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation

Add code
Mar 27, 2026
Viaarxiv icon

ReMem-VLA: Empowering Vision-Language-Action Model with Memory via Dual-Level Recurrent Queries

Add code
Mar 13, 2026
Viaarxiv icon