Alert button
Picture for Gautam Bhattacharya

Gautam Bhattacharya

Alert button

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models

Add code
Bookmark button
Alert button
Jun 16, 2023
Hao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian McAuley

Figure 1 for CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Figure 2 for CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Figure 3 for CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Figure 4 for CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Viaarxiv icon

Full-band General Audio Synthesis with Score-based Diffusion

Add code
Bookmark button
Alert button
Oct 26, 2022
Santiago Pascual, Gautam Bhattacharya, Chunghsin Yeh, Jordi Pons, Joan Serrà

Viaarxiv icon

Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification

Add code
Bookmark button
Alert button
Nov 07, 2018
Gautam Bhattacharya, Joao Monteiro, Jahangir Alam, Patrick Kenny

Figure 1 for Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification
Figure 2 for Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification
Figure 3 for Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification
Viaarxiv icon

Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training

Add code
Bookmark button
Alert button
Nov 07, 2018
Gautam Bhattacharya, Jahangir Alam, Patrick Kenny

Figure 1 for Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training
Figure 2 for Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training
Figure 3 for Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training
Viaarxiv icon