Alert button
Picture for Renjie Zheng

Renjie Zheng

Alert button

ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech

Nov 07, 2022
Xiaoran Fan, Chao Pang, Tian Yuan, He Bai, Renjie Zheng, Pengfei Zhu, Shuohuan Wang, Junkun Chen, Zeyu Chen, Liang Huang, Yu Sun, Hua Wu

Figure 1 for ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech
Figure 2 for ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech
Figure 3 for ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech
Figure 4 for ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech
Viaarxiv icon

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

May 20, 2022
Hui Zhang, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, Dianhai Yu, Yanjun Ma, Liang Huang

Figure 1 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Figure 2 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Figure 3 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Figure 4 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit
Viaarxiv icon

Data-Driven Adaptive Simultaneous Machine Translation

Apr 27, 2022
Guangxu Xun, Mingbo Ma, Yuchen Bian, Xingyu Cai, Jiaji Huang, Renjie Zheng, Junkun Chen, Jiahong Yuan, Kenneth Church, Liang Huang

Figure 1 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 2 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 3 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 4 for Data-Driven Adaptive Simultaneous Machine Translation
Viaarxiv icon

A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing

Mar 18, 2022
He Bai, Renjie Zheng, Junkun Chen, Xintong Li, Mingbo Ma, Liang Huang

Figure 1 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 2 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 3 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 4 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Viaarxiv icon

The Role of Phonetic Units in Speech Emotion Recognition

Aug 02, 2021
Jiahong Yuan, Xingyu Cai, Renjie Zheng, Liang Huang, Kenneth Church

Figure 1 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 2 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 3 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 4 for The Role of Phonetic Units in Speech Emotion Recognition
Viaarxiv icon

Decoupling recognition and transcription in Mandarin ASR

Aug 02, 2021
Jiahong Yuan, Xingyu Cai, Dongji Gao, Renjie Zheng, Liang Huang, Kenneth Church

Figure 1 for Decoupling recognition and transcription in Mandarin ASR
Figure 2 for Decoupling recognition and transcription in Mandarin ASR
Figure 3 for Decoupling recognition and transcription in Mandarin ASR
Figure 4 for Decoupling recognition and transcription in Mandarin ASR
Viaarxiv icon

Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR

Jun 11, 2021
Junkun Chen, Mingbo Ma, Renjie Zheng, Liang Huang

Figure 1 for Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR
Figure 2 for Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR
Figure 3 for Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR
Figure 4 for Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR
Viaarxiv icon

Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation

Feb 10, 2021
Renjie Zheng, Junkun Chen, Mingbo Ma, Liang Huang

Figure 1 for Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Figure 2 for Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Figure 3 for Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Figure 4 for Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Viaarxiv icon

MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation

Oct 22, 2020
Junkun Chen, Mingbo Ma, Renjie Zheng, Liang Huang

Figure 1 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 2 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 3 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 4 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Viaarxiv icon

Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training

Oct 21, 2020
Renjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu, Jiahong Yuan, Kenneth Church, Liang Huang

Figure 1 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 2 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 3 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 4 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Viaarxiv icon