Picture for Chao Weng

Chao Weng

Rep2wav: Noise Robust text-to-speech Using self-supervised representations

Add code
Sep 04, 2023
Figure 1 for Rep2wav: Noise Robust text-to-speech Using self-supervised representations
Figure 2 for Rep2wav: Noise Robust text-to-speech Using self-supervised representations
Figure 3 for Rep2wav: Noise Robust text-to-speech Using self-supervised representations
Figure 4 for Rep2wav: Noise Robust text-to-speech Using self-supervised representations
Viaarxiv icon

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression

Add code
Aug 21, 2023
Figure 1 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 2 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 3 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Figure 4 for Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression
Viaarxiv icon

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction

Add code
Aug 19, 2023
Viaarxiv icon

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

Add code
Jul 13, 2023
Figure 1 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 2 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 3 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 4 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Viaarxiv icon

Make-A-Voice: Unified Voice Synthesis With Discrete Representation

Add code
May 30, 2023
Viaarxiv icon

Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model

Add code
May 26, 2023
Viaarxiv icon

Eeg2vec: Self-Supervised Electroencephalographic Representation Learning

Add code
May 23, 2023
Figure 1 for Eeg2vec: Self-Supervised Electroencephalographic Representation Learning
Figure 2 for Eeg2vec: Self-Supervised Electroencephalographic Representation Learning
Figure 3 for Eeg2vec: Self-Supervised Electroencephalographic Representation Learning
Figure 4 for Eeg2vec: Self-Supervised Electroencephalographic Representation Learning
Viaarxiv icon

HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec

Add code
May 07, 2023
Viaarxiv icon

InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt

Add code
Jan 31, 2023
Figure 1 for InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Figure 2 for InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Figure 3 for InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Figure 4 for InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt
Viaarxiv icon

High Fidelity Speech Enhancement with Band-split RNN

Add code
Dec 01, 2022
Viaarxiv icon