Picture for Wenning Wei

Wenning Wei

Fine-Tuning Large Multimodal Models for Automatic Pronunciation Assessment

Add code
Sep 19, 2025
Viaarxiv icon

Exploring the Potential of Large Multimodal Models as Effective Alternatives for Pronunciation Assessment

Add code
Mar 14, 2025
Viaarxiv icon

MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023

Add code
Sep 12, 2023
Viaarxiv icon

On Addressing Practical Challenges for RNN-Transducer

Add code
May 04, 2021
Figure 1 for On Addressing Practical Challenges for RNN-Transducer
Figure 2 for On Addressing Practical Challenges for RNN-Transducer
Figure 3 for On Addressing Practical Challenges for RNN-Transducer
Figure 4 for On Addressing Practical Challenges for RNN-Transducer
Viaarxiv icon

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability

Add code
Jul 30, 2020
Figure 1 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 2 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 3 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Figure 4 for Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
Viaarxiv icon