Picture for Munhak Lee

Munhak Lee

BBPE16: UTF-16-based byte-level byte-pair encoding for improved multilingual speech recognition

Add code
Feb 02, 2026
Viaarxiv icon