Alert button
Picture for Mingbo Ma

Mingbo Ma

Alert button

VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing

Add code
Bookmark button
Alert button
Apr 11, 2024
Philip Anastassiou, Zhenyu Tang, Kainan Peng, Dongya Jia, Jiaxin Li, Ming Tu, Yuping Wang, Yuxuan Wang, Mingbo Ma

Viaarxiv icon

Efficient Neural Music Generation

Add code
Bookmark button
Alert button
May 25, 2023
Max W. Y. Lam, Qiao Tian, Tang Li, Zongyu Yin, Siyuan Feng, Ming Tu, Yuliang Ji, Rui Xia, Mingbo Ma, Xuchen Song, Jitong Chen, Yuping Wang, Yuxuan Wang

Figure 1 for Efficient Neural Music Generation
Figure 2 for Efficient Neural Music Generation
Figure 3 for Efficient Neural Music Generation
Figure 4 for Efficient Neural Music Generation
Viaarxiv icon

Non-parallel Accent Conversion using Pseudo Siamese Disentanglement Network

Add code
Bookmark button
Alert button
Dec 12, 2022
Dongya Jia, Qiao Tian, Jiaxin Li, Yuanzhe Chen, Kainan Peng, Mingbo Ma, Yuping Wang, Yuxuan Wang

Figure 1 for Non-parallel Accent Conversion using Pseudo Siamese Disentanglement Network
Figure 2 for Non-parallel Accent Conversion using Pseudo Siamese Disentanglement Network
Figure 3 for Non-parallel Accent Conversion using Pseudo Siamese Disentanglement Network
Figure 4 for Non-parallel Accent Conversion using Pseudo Siamese Disentanglement Network
Viaarxiv icon

Data-Driven Adaptive Simultaneous Machine Translation

Add code
Bookmark button
Alert button
Apr 27, 2022
Guangxu Xun, Mingbo Ma, Yuchen Bian, Xingyu Cai, Jiaji Huang, Renjie Zheng, Junkun Chen, Jiahong Yuan, Kenneth Church, Liang Huang

Figure 1 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 2 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 3 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 4 for Data-Driven Adaptive Simultaneous Machine Translation
Viaarxiv icon

A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing

Add code
Bookmark button
Alert button
Mar 18, 2022
He Bai, Renjie Zheng, Junkun Chen, Xintong Li, Mingbo Ma, Liang Huang

Figure 1 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 2 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 3 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 4 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Viaarxiv icon

Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR

Add code
Bookmark button
Alert button
Jun 11, 2021
Junkun Chen, Mingbo Ma, Renjie Zheng, Liang Huang

Figure 1 for Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR
Figure 2 for Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR
Figure 3 for Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR
Figure 4 for Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR
Viaarxiv icon

Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation

Add code
Bookmark button
Alert button
Feb 10, 2021
Renjie Zheng, Junkun Chen, Mingbo Ma, Liang Huang

Figure 1 for Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Figure 2 for Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Figure 3 for Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Figure 4 for Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
Viaarxiv icon

MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation

Add code
Bookmark button
Alert button
Oct 22, 2020
Junkun Chen, Mingbo Ma, Renjie Zheng, Liang Huang

Figure 1 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 2 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 3 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Figure 4 for MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation
Viaarxiv icon

Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training

Add code
Bookmark button
Alert button
Oct 21, 2020
Renjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu, Jiahong Yuan, Kenneth Church, Liang Huang

Figure 1 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 2 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 3 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 4 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Viaarxiv icon