Alert button
Picture for Benlai Tang

Benlai Tang

Alert button

Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling

Add code
Bookmark button
Alert button
Apr 14, 2024
Quanxiu Wang, Hui Huang, Mingjie Wang, Yong Dai, Jinzuomu Zhong, Benlai Tang

Viaarxiv icon

Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP

Add code
Bookmark button
Alert button
Sep 11, 2023
Jinzuomu Zhong, Yang Li, Hui Huang, Jie Liu, Zhiba Su, Jing Guo, Benlai Tang, Fengjie Zhu

Figure 1 for Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP
Figure 2 for Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP
Figure 3 for Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP
Figure 4 for Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP
Viaarxiv icon

TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection

Add code
Bookmark button
Alert button
Jun 27, 2023
Jie Liu, Zhiba Su, Hui Huang, Caiyan Wan, Quanxiu Wang, Jiangli Hong, Benlai Tang, Fengjie Zhu

Figure 1 for TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection
Figure 2 for TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection
Figure 3 for TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection
Viaarxiv icon

CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation

Add code
Bookmark button
Alert button
May 23, 2023
Jingning Xu, Benlai Tang, Mingjie Wang, Minghao Li, Meirong Ma

Figure 1 for CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation
Figure 2 for CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation
Figure 3 for CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation
Figure 4 for CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation
Viaarxiv icon

Towards Realistic Visual Dubbing with Heterogeneous Sources

Add code
Bookmark button
Alert button
Jan 17, 2022
Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun Ma

Figure 1 for Towards Realistic Visual Dubbing with Heterogeneous Sources
Figure 2 for Towards Realistic Visual Dubbing with Heterogeneous Sources
Figure 3 for Towards Realistic Visual Dubbing with Heterogeneous Sources
Figure 4 for Towards Realistic Visual Dubbing with Heterogeneous Sources
Viaarxiv icon

Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation

Add code
Bookmark button
Alert button
Oct 25, 2021
Jingning Xu, Benlai Tang, Mingjie Wang, Siyuan Bian, Wenyi Guo, Xiang Yin, Zejun Ma

Figure 1 for Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation
Figure 2 for Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation
Figure 3 for Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation
Figure 4 for Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation
Viaarxiv icon

Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding

Add code
Bookmark button
Alert button
Oct 10, 2021
Chao Wang, Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Yibiao Yu, Zejun Ma

Figure 1 for Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding
Figure 2 for Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding
Figure 3 for Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding
Figure 4 for Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding
Viaarxiv icon

PPG-based singing voice conversion with adversarial representation learning

Add code
Bookmark button
Alert button
Oct 28, 2020
Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Ling Xu, Chen Shen, Zejun Ma

Figure 1 for PPG-based singing voice conversion with adversarial representation learning
Figure 2 for PPG-based singing voice conversion with adversarial representation learning
Figure 3 for PPG-based singing voice conversion with adversarial representation learning
Figure 4 for PPG-based singing voice conversion with adversarial representation learning
Viaarxiv icon

Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech

Add code
Bookmark button
Alert button
May 19, 2020
Wenjie Li, Benlai Tang, Xiang Yin, Yushi Zhao, Wei Li, Kang Wang, Hao Huang, Yuxuan Wang, Zejun Ma

Figure 1 for Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech
Figure 2 for Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech
Figure 3 for Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech
Figure 4 for Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech
Viaarxiv icon