Picture for Ngoc Thang Vu

Ngoc Thang Vu

Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent

Add code
Aug 26, 2024
Figure 1 for Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent
Figure 2 for Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent
Figure 3 for Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent
Figure 4 for Investigating the effect of Mental Models in User Interaction with an Adaptive Dialog Agent
Viaarxiv icon

Explaining Vision-Language Similarities in Dual Encoders with Feature-Pair Attributions

Add code
Aug 26, 2024
Figure 1 for Explaining Vision-Language Similarities in Dual Encoders with Feature-Pair Attributions
Figure 2 for Explaining Vision-Language Similarities in Dual Encoders with Feature-Pair Attributions
Figure 3 for Explaining Vision-Language Similarities in Dual Encoders with Feature-Pair Attributions
Figure 4 for Explaining Vision-Language Similarities in Dual Encoders with Feature-Pair Attributions
Viaarxiv icon

Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses

Add code
Jul 26, 2024
Figure 1 for Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses
Figure 2 for Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses
Figure 3 for Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses
Figure 4 for Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses
Viaarxiv icon

Probing the Feasibility of Multilingual Speaker Anonymization

Add code
Jul 03, 2024
Figure 1 for Probing the Feasibility of Multilingual Speaker Anonymization
Figure 2 for Probing the Feasibility of Multilingual Speaker Anonymization
Figure 3 for Probing the Feasibility of Multilingual Speaker Anonymization
Figure 4 for Probing the Feasibility of Multilingual Speaker Anonymization
Viaarxiv icon

Controlling Emotion in Text-to-Speech with Natural Language Prompts

Add code
Jun 11, 2024
Figure 1 for Controlling Emotion in Text-to-Speech with Natural Language Prompts
Figure 2 for Controlling Emotion in Text-to-Speech with Natural Language Prompts
Figure 3 for Controlling Emotion in Text-to-Speech with Natural Language Prompts
Figure 4 for Controlling Emotion in Text-to-Speech with Natural Language Prompts
Viaarxiv icon

Meta Learning Text-to-Speech Synthesis in over 7000 Languages

Add code
Jun 10, 2024
Figure 1 for Meta Learning Text-to-Speech Synthesis in over 7000 Languages
Figure 2 for Meta Learning Text-to-Speech Synthesis in over 7000 Languages
Figure 3 for Meta Learning Text-to-Speech Synthesis in over 7000 Languages
Figure 4 for Meta Learning Text-to-Speech Synthesis in over 7000 Languages
Viaarxiv icon

Prompting-based Synthetic Data Generation for Few-Shot Question Answering

Add code
May 15, 2024
Viaarxiv icon

Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training

Add code
Apr 16, 2024
Viaarxiv icon

Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering

Add code
Mar 27, 2024
Viaarxiv icon

Towards a Zero-Data, Controllable, Adaptive Dialog System

Add code
Mar 26, 2024
Viaarxiv icon