Picture for Jaeyoung Roh

Jaeyoung Roh

Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting

Add code
May 22, 2025
Viaarxiv icon

CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting

Add code
Jun 12, 2024
Viaarxiv icon

Relational Proxy Loss for Audio-Text based Keyword Spotting

Add code
Jun 08, 2024
Figure 1 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 2 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 3 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 4 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Viaarxiv icon