Picture for Brian Siyuan Zheng

Brian Siyuan Zheng

Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations

Add code
Jun 23, 2025
Viaarxiv icon