Picture for Ngoc Thang Vu

Ngoc Thang Vu

Oh, Jeez! or Uh-huh? A Listener-aware Backchannel Predictor on ASR Transcriptions

Add code
Apr 10, 2023
Viaarxiv icon

Conversational Tree Search: A New Hybrid Dialog Task

Add code
Mar 17, 2023
Viaarxiv icon

ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English

Add code
Nov 22, 2022
Figure 1 for ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English
Figure 2 for ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English
Figure 3 for ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English
Figure 4 for ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic - English
Viaarxiv icon

Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

Add code
Oct 20, 2022
Figure 1 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 2 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 3 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Figure 4 for Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
Viaarxiv icon

How (Not) To Evaluate Explanation Quality

Add code
Oct 13, 2022
Figure 1 for How (Not) To Evaluate Explanation Quality
Figure 2 for How (Not) To Evaluate Explanation Quality
Figure 3 for How (Not) To Evaluate Explanation Quality
Figure 4 for How (Not) To Evaluate Explanation Quality
Viaarxiv icon

Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text

Add code
Oct 11, 2022
Figure 1 for Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
Figure 2 for Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
Figure 3 for Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
Figure 4 for Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
Viaarxiv icon

The Who in Code-Switching: A Case Study for Predicting Egyptian Arabic-English Code-Switching Levels based on Character Profiles

Add code
Jul 31, 2022
Figure 1 for The Who in Code-Switching: A Case Study for Predicting Egyptian Arabic-English Code-Switching Levels based on Character Profiles
Figure 2 for The Who in Code-Switching: A Case Study for Predicting Egyptian Arabic-English Code-Switching Levels based on Character Profiles
Figure 3 for The Who in Code-Switching: A Case Study for Predicting Egyptian Arabic-English Code-Switching Levels based on Character Profiles
Figure 4 for The Who in Code-Switching: A Case Study for Predicting Egyptian Arabic-English Code-Switching Levels based on Character Profiles
Viaarxiv icon

PoeticTTS -- Controllable Poetry Reading for Literary Studies

Add code
Jul 11, 2022
Figure 1 for PoeticTTS -- Controllable Poetry Reading for Literary Studies
Figure 2 for PoeticTTS -- Controllable Poetry Reading for Literary Studies
Figure 3 for PoeticTTS -- Controllable Poetry Reading for Literary Studies
Figure 4 for PoeticTTS -- Controllable Poetry Reading for Literary Studies
Viaarxiv icon

Speaker Anonymization with Phonetic Intermediate Representations

Add code
Jul 11, 2022
Figure 1 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 2 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 3 for Speaker Anonymization with Phonetic Intermediate Representations
Figure 4 for Speaker Anonymization with Phonetic Intermediate Representations
Viaarxiv icon

Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech

Add code
Jun 24, 2022
Figure 1 for Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech
Figure 2 for Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech
Figure 3 for Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech
Figure 4 for Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech
Viaarxiv icon