Picture for Xipeng Qiu

Xipeng Qiu

Two Bridges, One Pathway: From VLMs to Generalizable VLAs with Embodied Trajectory-Coupled Data

Add code
Jun 07, 2026
Viaarxiv icon

Coarse-to-Control: Action-Token Planning for Vision-Language-Action Models

Add code
Jun 05, 2026
Viaarxiv icon

Let It Be Simple: One-Step Action Generation for Vision-Language-Action Models

Add code
Jun 04, 2026
Viaarxiv icon

MOSS-Audio Technical Report

Add code
Jun 01, 2026
Viaarxiv icon

AdaptR1: Reinforcement Learning Based Adaptive Interleaved Thinking in Multi-hop Question Answering

Add code
May 29, 2026
Viaarxiv icon

Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation

Add code
May 28, 2026
Viaarxiv icon

World Action Models: The Next Frontier in Embodied AI

Add code
May 12, 2026
Viaarxiv icon

X-Voice: Enabling Everyone to Speak 30 Languages via Zero-Shot Cross-Lingual Voice Cloning

Add code
May 07, 2026
Viaarxiv icon

Beyond Rating: A Comprehensive Evaluation and Benchmark for AI Reviews

Add code
Apr 22, 2026
Viaarxiv icon

X-VC: Zero-shot Streaming Voice Conversion in Codec Space

Add code
Apr 14, 2026
Viaarxiv icon