Picture for Shiqi Han

Shiqi Han

The WER Trap: Shattering the Illusion of Unified Tokens in Speech Language Models

Add code
May 28, 2026
Viaarxiv icon