Picture for Jee-weon Jung

Jee-weon Jung

Beyond Silence: Bias Analysis through Loss and Asymmetric Approach in Audio Anti-Spoofing

Add code
Jun 25, 2024
Viaarxiv icon

On the Evaluation of Speech Foundation Models for Spoken Language Understanding

Add code
Jun 14, 2024
Viaarxiv icon

To what extent can ASV systems naturally defend against spoofing attacks?

Add code
Jun 08, 2024
Figure 1 for To what extent can ASV systems naturally defend against spoofing attacks?
Figure 2 for To what extent can ASV systems naturally defend against spoofing attacks?
Figure 3 for To what extent can ASV systems naturally defend against spoofing attacks?
Figure 4 for To what extent can ASV systems naturally defend against spoofing attacks?
Viaarxiv icon

Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement

Add code
Jun 06, 2024
Viaarxiv icon

a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification

Add code
Mar 03, 2024
Figure 1 for a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification
Figure 2 for a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification
Figure 3 for a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification
Figure 4 for a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification
Viaarxiv icon

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Add code
Feb 25, 2024
Viaarxiv icon

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

Add code
Feb 01, 2024
Figure 1 for Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Figure 2 for Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Figure 3 for Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Figure 4 for Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
Viaarxiv icon

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Add code
Jan 30, 2024
Figure 1 for OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
Figure 2 for OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
Figure 3 for OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
Figure 4 for OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
Viaarxiv icon

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

Add code
Jan 30, 2024
Figure 1 for ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Figure 2 for ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Figure 3 for ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Figure 4 for ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models
Viaarxiv icon

Improving Design of Input Condition Invariant Speech Enhancement

Add code
Jan 25, 2024
Figure 1 for Improving Design of Input Condition Invariant Speech Enhancement
Figure 2 for Improving Design of Input Condition Invariant Speech Enhancement
Figure 3 for Improving Design of Input Condition Invariant Speech Enhancement
Figure 4 for Improving Design of Input Condition Invariant Speech Enhancement
Viaarxiv icon