Picture for Youkyum Kim

Youkyum Kim

Bridging the Gap between Audio and Text using Parallel-attention for User-defined Keyword Spotting

Add code
Aug 07, 2024
Figure 1 for Bridging the Gap between Audio and Text using Parallel-attention for User-defined Keyword Spotting
Figure 2 for Bridging the Gap between Audio and Text using Parallel-attention for User-defined Keyword Spotting
Figure 3 for Bridging the Gap between Audio and Text using Parallel-attention for User-defined Keyword Spotting
Viaarxiv icon

VoxSim: A perceptual voice similarity dataset

Add code
Jul 26, 2024
Figure 1 for VoxSim: A perceptual voice similarity dataset
Figure 2 for VoxSim: A perceptual voice similarity dataset
Figure 3 for VoxSim: A perceptual voice similarity dataset
Figure 4 for VoxSim: A perceptual voice similarity dataset
Viaarxiv icon

Metric Learning for User-defined Keyword Spotting

Add code
Nov 01, 2022
Viaarxiv icon

Disentangled representation learning for multilingual speaker recognition

Add code
Nov 01, 2022
Figure 1 for Disentangled representation learning for multilingual speaker recognition
Figure 2 for Disentangled representation learning for multilingual speaker recognition
Figure 3 for Disentangled representation learning for multilingual speaker recognition
Viaarxiv icon