Picture for Ruixiong Zhang

Ruixiong Zhang

DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

Add code
Oct 11, 2023
Figure 1 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 2 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 3 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Figure 4 for DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies
Viaarxiv icon

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Add code
Nov 05, 2022
Figure 1 for VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Figure 2 for VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Figure 3 for VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Figure 4 for VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
Viaarxiv icon

A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture

Add code
Apr 12, 2022
Figure 1 for A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Figure 2 for A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Figure 3 for A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Figure 4 for A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture
Viaarxiv icon

Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data

Add code
Nov 15, 2021
Figure 1 for Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data
Figure 2 for Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data
Figure 3 for Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data
Figure 4 for Improving Prosody for Unseen Texts in Speech Synthesis by Utilizing Linguistic Information and Noisy Data
Viaarxiv icon

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning

Add code
Oct 27, 2020
Figure 1 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 2 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 3 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 4 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Viaarxiv icon

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

Add code
Jun 23, 2020
Figure 1 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 2 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 3 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 4 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Viaarxiv icon

Cross-task pre-training for acoustic scene classification

Add code
Oct 22, 2019
Figure 1 for Cross-task pre-training for acoustic scene classification
Figure 2 for Cross-task pre-training for acoustic scene classification
Figure 3 for Cross-task pre-training for acoustic scene classification
Figure 4 for Cross-task pre-training for acoustic scene classification
Viaarxiv icon