Alert button
Picture for Jingbei Li

Jingbei Li

Alert button

DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin

Add code
Bookmark button
Alert button
Sep 02, 2023
Tao Li, Chenxu Hu, Jian Cong, Xinfa Zhu, Jingbei Li, Qiao Tian, Yuping Wang, Lei Xie

Figure 1 for DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin
Figure 2 for DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin
Figure 3 for DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin
Figure 4 for DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech -- A Study between English and Mandarin
Viaarxiv icon

Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing

Add code
Bookmark button
Alert button
May 09, 2023
Jingbei Li, Sipan Li, Ping Chen, Luwen Zhang, Yi Meng, Zhiyong Wu, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing
Figure 2 for Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing
Figure 3 for Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing
Figure 4 for Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing
Viaarxiv icon

NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism

Add code
Bookmark button
Alert button
Mar 31, 2022
Jingbei Li, Yi Meng, Zhiyong Wu, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
Figure 2 for NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
Figure 3 for NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
Figure 4 for NeuFA: Neural Network Based End-to-End Forced Alignment with Bidirectional Attention Mechanism
Viaarxiv icon

Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Jun 11, 2021
Jingbei Li, Yi Meng, Chenyi Li, Zhiyong Wu, Helen Meng, Chao Weng, Dan Su

Figure 1 for Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis
Figure 2 for Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis
Figure 3 for Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis
Figure 4 for Spoken Style Learning with Multi-modal Hierarchical Context Encoding for Conversational Text-to-Speech Synthesis
Viaarxiv icon

Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech

Add code
Bookmark button
Alert button
Apr 20, 2021
Yixuan Zhou, Changhe Song, Jingbei Li, Zhiyong Wu, Helen Meng

Figure 1 for Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Figure 2 for Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Figure 3 for Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Figure 4 for Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Viaarxiv icon

Towards Multi-Scale Style Control for Expressive Speech Synthesis

Add code
Bookmark button
Alert button
Apr 08, 2021
Xiang Li, Changhe Song, Jingbei Li, Zhiyong Wu, Jia Jia, Helen Meng

Figure 1 for Towards Multi-Scale Style Control for Expressive Speech Synthesis
Figure 2 for Towards Multi-Scale Style Control for Expressive Speech Synthesis
Figure 3 for Towards Multi-Scale Style Control for Expressive Speech Synthesis
Figure 4 for Towards Multi-Scale Style Control for Expressive Speech Synthesis
Viaarxiv icon

Adversarially learning disentangled speech representations for robust multi-factor voice conversion

Add code
Bookmark button
Alert button
Jan 30, 2021
Jie Wang, Jingbei Li, Xintao Zhao, Zhiyong Wu, Helen Meng

Figure 1 for Adversarially learning disentangled speech representations for robust multi-factor voice conversion
Figure 2 for Adversarially learning disentangled speech representations for robust multi-factor voice conversion
Figure 3 for Adversarially learning disentangled speech representations for robust multi-factor voice conversion
Figure 4 for Adversarially learning disentangled speech representations for robust multi-factor voice conversion
Viaarxiv icon

Syntactic representation learning for neural network based TTS with syntactic parse tree traversal

Add code
Bookmark button
Alert button
Dec 13, 2020
Changhe Song, Jingbei Li, Yixuan Zhou, Zhiyong Wu, Helen Meng

Figure 1 for Syntactic representation learning for neural network based TTS with syntactic parse tree traversal
Figure 2 for Syntactic representation learning for neural network based TTS with syntactic parse tree traversal
Figure 3 for Syntactic representation learning for neural network based TTS with syntactic parse tree traversal
Figure 4 for Syntactic representation learning for neural network based TTS with syntactic parse tree traversal
Viaarxiv icon