Picture for Heng Qu

Heng Qu

HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

Add code
Jun 12, 2026
Viaarxiv icon

Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation

Add code
May 26, 2026
Viaarxiv icon

How Mobile World Model Guides GUI Agents?

Add code
May 11, 2026
Viaarxiv icon

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution

Add code
Feb 13, 2026
Viaarxiv icon

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon