Picture for Nancy F. Chen

Nancy F. Chen

Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs

Add code
Jun 07, 2025
Viaarxiv icon

What Makes a Good Natural Language Prompt?

Add code
Jun 07, 2025
Viaarxiv icon

Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems

Add code
May 21, 2025
Viaarxiv icon

Distilling a speech and music encoder with task arithmetic

Add code
May 19, 2025
Viaarxiv icon

Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation

Add code
May 10, 2025
Viaarxiv icon

Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts

Add code
Apr 15, 2025
Viaarxiv icon

Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models

Add code
Jan 02, 2025
Figure 1 for Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models
Figure 2 for Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models
Figure 3 for Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models
Figure 4 for Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models
Viaarxiv icon

MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond

Add code
Dec 20, 2024
Viaarxiv icon

MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models

Add code
Dec 18, 2024
Figure 1 for MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models
Figure 2 for MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models
Viaarxiv icon

Towards a Speech Foundation Model for Singapore and Beyond

Add code
Dec 16, 2024
Viaarxiv icon