Picture for Yueteng Kang

Yueteng Kang

MPE-TTS: Customized Emotion Zero-Shot Text-To-Speech Using Multi-Modal Prompt

Add code
May 24, 2025
Viaarxiv icon

FreeCodec: A disentangled neural speech codec with fewer tokens

Add code
Dec 02, 2024
Viaarxiv icon

DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model

Add code
Mar 16, 2023
Viaarxiv icon

Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning

Add code
Sep 15, 2021
Figure 1 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 2 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 3 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 4 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Viaarxiv icon