Picture for Min Tang

Min Tang

Efficient Sparse Attention needs Adaptive Token Release

Add code
Jul 02, 2024
Viaarxiv icon

E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS

Add code
Jun 26, 2024
Figure 1 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 2 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 3 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 4 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Viaarxiv icon

An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS

Add code
Jun 09, 2024
Viaarxiv icon

Total-Duration-Aware Duration Modeling for Text-to-Speech Systems

Add code
Jun 06, 2024
Figure 1 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 2 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 3 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 4 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Viaarxiv icon

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Add code
Feb 12, 2024
Viaarxiv icon

NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription

Add code
Jan 16, 2024
Figure 1 for NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription
Figure 2 for NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription
Viaarxiv icon

SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

Add code
Aug 14, 2023
Figure 1 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 2 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 3 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Figure 4 for SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Viaarxiv icon

CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network

Add code
May 30, 2023
Figure 1 for CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network
Figure 2 for CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network
Figure 3 for CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network
Figure 4 for CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network
Viaarxiv icon

ICASSP 2023 Deep Speech Enhancement Challenge

Add code
Mar 21, 2023
Figure 1 for ICASSP 2023 Deep Speech Enhancement Challenge
Viaarxiv icon

Real-Time Audio-Visual End-to-End Speech Enhancement

Add code
Mar 13, 2023
Figure 1 for Real-Time Audio-Visual End-to-End Speech Enhancement
Figure 2 for Real-Time Audio-Visual End-to-End Speech Enhancement
Figure 3 for Real-Time Audio-Visual End-to-End Speech Enhancement
Viaarxiv icon