Picture for Soo-Whan Chung

Soo-Whan Chung

Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation

Add code
Jun 18, 2024
Viaarxiv icon

MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion

Add code
Jun 16, 2023
Figure 1 for MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion
Figure 2 for MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion
Figure 3 for MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion
Figure 4 for MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion
Viaarxiv icon

HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders

Add code
Jun 02, 2023
Figure 1 for HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Figure 2 for HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Figure 3 for HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Figure 4 for HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders
Viaarxiv icon

Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech

Add code
Feb 27, 2023
Figure 1 for Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Figure 2 for Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Figure 3 for Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Figure 4 for Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech
Viaarxiv icon

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Add code
Feb 27, 2023
Figure 1 for MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
Figure 2 for MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
Figure 3 for MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
Figure 4 for MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition
Viaarxiv icon

Diffusion-based Generative Speech Source Separation

Add code
Nov 02, 2022
Figure 1 for Diffusion-based Generative Speech Source Separation
Figure 2 for Diffusion-based Generative Speech Source Separation
Figure 3 for Diffusion-based Generative Speech Source Separation
Figure 4 for Diffusion-based Generative Speech Source Separation
Viaarxiv icon

Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting

Add code
Jul 01, 2022
Figure 1 for Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Figure 2 for Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Figure 3 for Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Figure 4 for Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting
Viaarxiv icon

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion

Add code
Apr 21, 2022
Figure 1 for Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Figure 2 for Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Figure 3 for Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Figure 4 for Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Viaarxiv icon

SASV 2022: The First Spoofing-Aware Speaker Verification Challenge

Add code
Mar 28, 2022
Figure 1 for SASV 2022: The First Spoofing-Aware Speaker Verification Challenge
Figure 2 for SASV 2022: The First Spoofing-Aware Speaker Verification Challenge
Figure 3 for SASV 2022: The First Spoofing-Aware Speaker Verification Challenge
Figure 4 for SASV 2022: The First Spoofing-Aware Speaker Verification Challenge
Viaarxiv icon

Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement

Add code
Feb 24, 2022
Figure 1 for Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement
Figure 2 for Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement
Figure 3 for Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement
Figure 4 for Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement
Viaarxiv icon