Picture for Min Tang

Min Tang

An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS

Add code
Jun 09, 2024
Viaarxiv icon

Total-Duration-Aware Duration Modeling for Text-to-Speech Systems

Add code
Jun 06, 2024
Viaarxiv icon

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Add code
Feb 12, 2024
Viaarxiv icon

NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription

Jan 16, 2024
Viaarxiv icon

SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

Add code
Aug 14, 2023
Figure 1 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 2 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 3 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 4 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Viaarxiv icon

CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network

May 30, 2023
Figure 1 for CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network
Figure 2 for CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network
Figure 3 for CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network
Figure 4 for CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network
Viaarxiv icon

ICASSP 2023 Deep Speech Enhancement Challenge

Add code
Mar 21, 2023
Figure 1 for ICASSP 2023 Deep Speech Enhancement Challenge
Viaarxiv icon

Real-Time Audio-Visual End-to-End Speech Enhancement

Add code
Mar 13, 2023
Figure 1 for Real-Time Audio-Visual End-to-End Speech Enhancement
Figure 2 for Real-Time Audio-Visual End-to-End Speech Enhancement
Figure 3 for Real-Time Audio-Visual End-to-End Speech Enhancement
Viaarxiv icon

Exploring WavLM on Speech Enhancement

Nov 18, 2022
Figure 1 for Exploring WavLM on Speech Enhancement
Figure 2 for Exploring WavLM on Speech Enhancement
Figure 3 for Exploring WavLM on Speech Enhancement
Viaarxiv icon

Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net

Nov 04, 2022
Figure 1 for Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net
Figure 2 for Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net
Figure 3 for Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation with E3Net
Viaarxiv icon