Alert button
Picture for Qianqian Dong

Qianqian Dong

Alert button

Speech Translation with Large Language Models: An Industrial Practice

Add code
Bookmark button
Alert button
Dec 21, 2023
Zhichao Huang, Rong Ye, Tom Ko, Qianqian Dong, Shanbo Cheng, Mingxuan Wang, Hang Li

Viaarxiv icon

Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition

Add code
Bookmark button
Alert button
Sep 21, 2023
Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang

Figure 1 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 2 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 3 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 4 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Viaarxiv icon

Recent Advances in Direct Speech-to-text Translation

Add code
Bookmark button
Alert button
Jun 20, 2023
Chen Xu, Rong Ye, Qianqian Dong, Chengqi Zhao, Tom Ko, Mingxuan Wang, Tong Xiao, Jingbo Zhu

Figure 1 for Recent Advances in Direct Speech-to-text Translation
Figure 2 for Recent Advances in Direct Speech-to-text Translation
Viaarxiv icon

MOSPC: MOS Prediction Based on Pairwise Comparison

Add code
Bookmark button
Alert button
Jun 18, 2023
Kexin Wang, Yunlong Zhao, Qianqian Dong, Tom Ko, Mingxuan Wang

Figure 1 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 2 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 3 for MOSPC: MOS Prediction Based on Pairwise Comparison
Figure 4 for MOSPC: MOS Prediction Based on Pairwise Comparison
Viaarxiv icon

PolyVoice: Language Models for Speech to Speech Translation

Add code
Bookmark button
Alert button
Jun 13, 2023
Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang

Figure 1 for PolyVoice: Language Models for Speech to Speech Translation
Figure 2 for PolyVoice: Language Models for Speech to Speech Translation
Figure 3 for PolyVoice: Language Models for Speech to Speech Translation
Figure 4 for PolyVoice: Language Models for Speech to Speech Translation
Viaarxiv icon

CTC-based Non-autoregressive Speech Translation

Add code
Bookmark button
Alert button
May 27, 2023
Chen Xu, Xiaoqian Liu, Xiaowen Liu, Qingxuan Sun, Yuhao Zhang, Murun Yang, Qianqian Dong, Tom Ko, Mingxuan Wang, Tong Xiao, Anxiang Ma, Jingbo Zhu

Figure 1 for CTC-based Non-autoregressive Speech Translation
Figure 2 for CTC-based Non-autoregressive Speech Translation
Figure 3 for CTC-based Non-autoregressive Speech Translation
Figure 4 for CTC-based Non-autoregressive Speech Translation
Viaarxiv icon

M3ST: Mix at Three Levels for Speech Translation

Add code
Bookmark button
Alert button
Dec 07, 2022
Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou

Figure 1 for M3ST: Mix at Three Levels for Speech Translation
Figure 2 for M3ST: Mix at Three Levels for Speech Translation
Figure 3 for M3ST: Mix at Three Levels for Speech Translation
Figure 4 for M3ST: Mix at Three Levels for Speech Translation
Viaarxiv icon

Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation

Add code
Bookmark button
Alert button
May 18, 2022
Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Qibing Bai, Yu Zhang

Figure 1 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 2 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 3 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 4 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Viaarxiv icon

UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation

Add code
Bookmark button
Alert button
Sep 15, 2021
Qianqian Dong, Yaoming Zhu, Mingxuan Wang, Lei Li

Figure 1 for UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation
Figure 2 for UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation
Figure 3 for UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation
Figure 4 for UniST: Unified End-to-end Model for Streaming and Non-streaming Speech Translation
Viaarxiv icon

The Volctrans Neural Speech Translation System for IWSLT 2021

Add code
Bookmark button
Alert button
May 16, 2021
Chengqi Zhao, Zhicheng Liu, Jian Tong, Tao Wang, Mingxuan Wang, Rong Ye, Qianqian Dong, Jun Cao, Lei Li

Figure 1 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 2 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 3 for The Volctrans Neural Speech Translation System for IWSLT 2021
Figure 4 for The Volctrans Neural Speech Translation System for IWSLT 2021
Viaarxiv icon