Picture for Tan Lee

Tan Lee

PodEval: A Multimodal Evaluation Framework for Podcast Audio Generation

Add code
Oct 01, 2025
Viaarxiv icon

Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong

Add code
May 23, 2025
Viaarxiv icon

Probing Speaker-specific Features in Speaker Representations

Add code
Jan 09, 2025
Figure 1 for Probing Speaker-specific Features in Speaker Representations
Figure 2 for Probing Speaker-specific Features in Speaker Representations
Figure 3 for Probing Speaker-specific Features in Speaker Representations
Figure 4 for Probing Speaker-specific Features in Speaker Representations
Viaarxiv icon

An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems

Add code
Nov 18, 2024
Figure 1 for An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems
Figure 2 for An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems
Figure 3 for An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems
Figure 4 for An Investigation of Reprogramming for Cross-Language Adaptation in Speaker Verification Systems
Viaarxiv icon

CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research

Add code
Sep 04, 2024
Figure 1 for CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research
Figure 2 for CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research
Figure 3 for CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research
Viaarxiv icon

User-Driven Voice Generation and Editing through Latent Space Navigation

Add code
Aug 30, 2024
Figure 1 for User-Driven Voice Generation and Editing through Latent Space Navigation
Figure 2 for User-Driven Voice Generation and Editing through Latent Space Navigation
Figure 3 for User-Driven Voice Generation and Editing through Latent Space Navigation
Figure 4 for User-Driven Voice Generation and Editing through Latent Space Navigation
Viaarxiv icon

ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis

Add code
Jun 13, 2024
Figure 1 for ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis
Figure 2 for ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis
Figure 3 for ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis
Figure 4 for ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis
Viaarxiv icon

A Parameter-efficient Language Extension Framework for Multilingual ASR

Add code
Jun 10, 2024
Figure 1 for A Parameter-efficient Language Extension Framework for Multilingual ASR
Figure 2 for A Parameter-efficient Language Extension Framework for Multilingual ASR
Viaarxiv icon

Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss

Add code
Jan 08, 2024
Viaarxiv icon

LUPET: Incorporating Hierarchical Information Path into Multilingual ASR

Add code
Jan 08, 2024
Figure 1 for LUPET: Incorporating Hierarchical Information Path into Multilingual ASR
Figure 2 for LUPET: Incorporating Hierarchical Information Path into Multilingual ASR
Figure 3 for LUPET: Incorporating Hierarchical Information Path into Multilingual ASR
Figure 4 for LUPET: Incorporating Hierarchical Information Path into Multilingual ASR
Viaarxiv icon