Picture for Hung-Yi Lee

Hung-Yi Lee

CAAD: Contrastive Audio-Aware Distillation for Efficient Speech Language Models

Add code
Jun 22, 2026
Viaarxiv icon

Steering Where to Listen: Instruction-Based Activation Steering Redirects Temporal Attention in Large Audio-Language Models

Add code
Jun 09, 2026
Viaarxiv icon

CodaRAG: Connecting the Dots with Associativity Inspired by Complementary Learning

Add code
Apr 12, 2026
Viaarxiv icon

Latent-Mark: An Audio Watermark Robust to Neural Resynthesis

Add code
Mar 05, 2026
Viaarxiv icon

On The Landscape of Spoken Language Models: A Comprehensive Survey

Add code
Apr 11, 2025
Figure 1 for On The Landscape of Spoken Language Models: A Comprehensive Survey
Figure 2 for On The Landscape of Spoken Language Models: A Comprehensive Survey
Figure 3 for On The Landscape of Spoken Language Models: A Comprehensive Survey
Figure 4 for On The Landscape of Spoken Language Models: A Comprehensive Survey
Viaarxiv icon

Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies

Add code
Jun 16, 2024
Figure 1 for Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies
Figure 2 for Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies
Figure 3 for Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies
Figure 4 for Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies
Viaarxiv icon

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

Add code
Feb 22, 2024
Viaarxiv icon

Examining Forgetting in Continual Pre-training of Aligned Large Language Models

Add code
Jan 06, 2024
Viaarxiv icon

Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

Add code
Jan 30, 2023
Figure 1 for Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Figure 2 for Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Figure 3 for Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Figure 4 for Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Viaarxiv icon

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

Add code
Dec 20, 2022
Viaarxiv icon