Picture for Zhenpeng Zhan

Zhenpeng Zhan

Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation

Add code
Jun 09, 2025
Viaarxiv icon

DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation

Add code
May 19, 2025
Viaarxiv icon

SerialGen: Personalized Image Generation by First Standardization Then Personalization

Add code
Dec 02, 2024
Viaarxiv icon