Picture for Nancy F. Chen

Nancy F. Chen

Semi-supervised Learning For Robust Speech Evaluation

Add code
Sep 23, 2024
Figure 1 for Semi-supervised Learning For Robust Speech Evaluation
Figure 2 for Semi-supervised Learning For Robust Speech Evaluation
Figure 3 for Semi-supervised Learning For Robust Speech Evaluation
Viaarxiv icon

Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization

Add code
Sep 16, 2024
Figure 1 for Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
Figure 2 for Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
Figure 3 for Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
Figure 4 for Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization
Viaarxiv icon

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders

Add code
Sep 10, 2024
Figure 1 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 2 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 3 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Figure 4 for MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
Viaarxiv icon

MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues

Add code
Aug 26, 2024
Figure 1 for MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues
Figure 2 for MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues
Figure 3 for MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues
Figure 4 for MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues
Viaarxiv icon

LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs

Add code
Aug 16, 2024
Viaarxiv icon

PRESENT: Zero-Shot Text-to-Prosody Control

Add code
Aug 13, 2024
Viaarxiv icon

TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations

Add code
Jul 02, 2024
Viaarxiv icon

AudioBench: A Universal Benchmark for Audio Large Language Models

Add code
Jun 25, 2024
Viaarxiv icon

Dataset-Distillation Generative Model for Speech Emotion Recognition

Add code
Jun 05, 2024
Viaarxiv icon

Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework

Add code
May 24, 2024
Figure 1 for Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework
Figure 2 for Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework
Figure 3 for Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework
Figure 4 for Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework
Viaarxiv icon