Picture for Shixi Huang

Shixi Huang

Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation

Add code
Jun 09, 2025
Viaarxiv icon

DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation

Add code
May 19, 2025
Viaarxiv icon