Alert button
Picture for Dongyang Dai

Dongyang Dai

Alert button

RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction

Add code
Bookmark button
Alert button
Mar 08, 2024
Peng Liu, Dongyang Dai

Figure 1 for RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction
Figure 2 for RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction
Figure 3 for RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction
Figure 4 for RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction
Viaarxiv icon

Cloning one's voice using very limited data in the wild

Add code
Bookmark button
Alert button
Oct 08, 2021
Dongyang Dai, Yuanzhe Chen, Li Chen, Ming Tu, Lu Liu, Rui Xia, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for Cloning one's voice using very limited data in the wild
Figure 2 for Cloning one's voice using very limited data in the wild
Figure 3 for Cloning one's voice using very limited data in the wild
Figure 4 for Cloning one's voice using very limited data in the wild
Viaarxiv icon

Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network

Add code
Bookmark button
Alert button
Dec 21, 2020
Xiong Cai, Zhiyong Wu, Kuo Zhong, Bin Su, Dongyang Dai, Helen Meng

Figure 1 for Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network
Figure 2 for Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network
Figure 3 for Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network
Figure 4 for Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network
Viaarxiv icon

Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams

Add code
Bookmark button
Alert button
Jun 20, 2020
Huirong Huang, Zhiyong Wu, Shiyin Kang, Dongyang Dai, Jia Jia, Tianxiao Fu, Deyi Tuo, Guangzhi Lei, Peng Liu, Dan Su, Dong Yu, Helen Meng

Figure 1 for Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams
Figure 2 for Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams
Figure 3 for Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams
Figure 4 for Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams
Viaarxiv icon

Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement

Add code
Bookmark button
Alert button
May 26, 2020
Dongyang Dai, Li Chen, Yuping Wang, Mu Wang, Rui Xia, Xuchen Song, Zhiyong Wu, Yuxuan Wang

Figure 1 for Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Figure 2 for Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Figure 3 for Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Figure 4 for Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
Viaarxiv icon