Picture for Hung-Yi Lee

Hung-Yi Lee

Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies

Add code
Jun 16, 2024
Viaarxiv icon

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

Add code
Feb 22, 2024
Figure 1 for EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Figure 2 for EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Figure 3 for EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Figure 4 for EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Viaarxiv icon

Examining Forgetting in Continual Pre-training of Aligned Large Language Models

Add code
Jan 06, 2024
Viaarxiv icon

Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

Add code
Jan 30, 2023
Figure 1 for Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Figure 2 for Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Figure 3 for Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Figure 4 for Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs
Viaarxiv icon

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

Add code
Dec 20, 2022
Figure 1 for SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Figure 2 for SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Figure 3 for SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Figure 4 for SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Viaarxiv icon

The Ability of Self-Supervised Speech Models for Audio Representations

Add code
Sep 28, 2022
Figure 1 for The Ability of Self-Supervised Speech Models for Audio Representations
Figure 2 for The Ability of Self-Supervised Speech Models for Audio Representations
Figure 3 for The Ability of Self-Supervised Speech Models for Audio Representations
Figure 4 for The Ability of Self-Supervised Speech Models for Audio Representations
Viaarxiv icon

On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting

Add code
Apr 01, 2022
Figure 1 for On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Figure 2 for On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Figure 3 for On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Figure 4 for On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
Viaarxiv icon

Partially Fake Audio Detection by Self-attention-based Fake Span Discovery

Add code
Feb 15, 2022
Figure 1 for Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Figure 2 for Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Figure 3 for Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Figure 4 for Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
Viaarxiv icon

S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations

Add code
Oct 12, 2021
Figure 1 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 2 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 3 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 4 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Viaarxiv icon

Analyzing the Robustness of Unsupervised Speech Recognition

Add code
Oct 12, 2021
Figure 1 for Analyzing the Robustness of Unsupervised Speech Recognition
Figure 2 for Analyzing the Robustness of Unsupervised Speech Recognition
Figure 3 for Analyzing the Robustness of Unsupervised Speech Recognition
Figure 4 for Analyzing the Robustness of Unsupervised Speech Recognition
Viaarxiv icon