Picture for Hwayeon Kim

Hwayeon Kim

ZeSTA: Zero-Shot TTS Augmentation with Domain-Conditioned Training for Data-Efficient Personalized Speech Synthesis

Add code
Mar 04, 2026
Viaarxiv icon

Exploring Fine-Tuning of Large Audio Language Models for Spoken Language Understanding under Limited Speech data

Add code
Sep 18, 2025
Viaarxiv icon

DESAMO: A Device for Elder-Friendly Smart Homes Powered by Embedded LLM with Audio Modality

Add code
Aug 26, 2025
Viaarxiv icon

CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model

Add code
Nov 19, 2024
Figure 1 for CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model
Figure 2 for CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model
Figure 3 for CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model
Figure 4 for CUE-M: Contextual Understanding and Enhanced Search with Multimodal Large Language Model
Viaarxiv icon