Picture for Hongsheng Li

Hongsheng Li

MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving

Add code
May 14, 2026
Viaarxiv icon

Edit-Based Refinement for Parallel Masked Diffusion Language Models

Add code
May 10, 2026
Viaarxiv icon

Context Unrolling in Omni Models

Add code
Apr 23, 2026
Viaarxiv icon

Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning

Add code
Apr 14, 2026
Viaarxiv icon

LMGenDrive: Bridging Multimodal Understanding and Generative World Modeling for End-to-End Driving

Add code
Apr 09, 2026
Viaarxiv icon

Weather-Conditioned Branch Routing for Robust LiDAR-Radar 3D Object Detection

Add code
Apr 07, 2026
Viaarxiv icon

MMEmb-R1: Reasoning-Enhanced Multimodal Embedding with Pair-Aware Selection and Adaptive Control

Add code
Apr 07, 2026
Viaarxiv icon

AURA: Always-On Understanding and Real-Time Assistance via Video Streams

Add code
Apr 05, 2026
Viaarxiv icon

ReinDriveGen: Reinforcement Post-Training for Out-of-Distribution Driving Scene Generation

Add code
Apr 01, 2026
Viaarxiv icon

ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework

Add code
Mar 21, 2026
Viaarxiv icon