Picture for Kong Aik Lee

Kong Aik Lee

Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection

Add code
Jun 25, 2024
Figure 1 for Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection
Figure 2 for Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection
Figure 3 for Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection
Figure 4 for Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection
Viaarxiv icon

Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis

Add code
Jun 16, 2024
Figure 1 for Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis
Figure 2 for Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis
Figure 3 for Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis
Figure 4 for Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis
Viaarxiv icon

Cosine Scoring with Uncertainty for Neural Speaker Embedding

Add code
Mar 11, 2024
Figure 1 for Cosine Scoring with Uncertainty for Neural Speaker Embedding
Figure 2 for Cosine Scoring with Uncertainty for Neural Speaker Embedding
Figure 3 for Cosine Scoring with Uncertainty for Neural Speaker Embedding
Figure 4 for Cosine Scoring with Uncertainty for Neural Speaker Embedding
Viaarxiv icon

VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis

Add code
Mar 01, 2024
Figure 1 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 2 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 3 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 4 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Viaarxiv icon

Generalizing Speaker Verification for Spoof Awareness in the Embedding Space

Add code
Jan 28, 2024
Figure 1 for Generalizing Speaker Verification for Spoof Awareness in the Embedding Space
Figure 2 for Generalizing Speaker Verification for Spoof Awareness in the Embedding Space
Figure 3 for Generalizing Speaker Verification for Spoof Awareness in the Embedding Space
Figure 4 for Generalizing Speaker Verification for Spoof Awareness in the Embedding Space
Viaarxiv icon

Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio

Add code
Jan 05, 2024
Figure 1 for Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio
Figure 2 for Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio
Figure 3 for Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio
Figure 4 for Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio
Viaarxiv icon

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification

Add code
Dec 06, 2023
Viaarxiv icon

Partially Randomizing Transformer Weights for Dialogue Response Diversity

Add code
Nov 18, 2023
Viaarxiv icon

An Empirical Bayes Framework for Open-Domain Dialogue Generation

Add code
Nov 18, 2023
Figure 1 for An Empirical Bayes Framework for Open-Domain Dialogue Generation
Figure 2 for An Empirical Bayes Framework for Open-Domain Dialogue Generation
Figure 3 for An Empirical Bayes Framework for Open-Domain Dialogue Generation
Figure 4 for An Empirical Bayes Framework for Open-Domain Dialogue Generation
Viaarxiv icon

Disentangling Voice and Content with Self-Supervision for Speaker Recognition

Add code
Oct 02, 2023
Figure 1 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 2 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 3 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 4 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Viaarxiv icon