Alert button
Picture for Kong Aik Lee

Kong Aik Lee

Alert button

Cosine Scoring with Uncertainty for Neural Speaker Embedding

Add code
Bookmark button
Alert button
Mar 11, 2024
Qiongqiong Wang, Kong Aik Lee

Figure 1 for Cosine Scoring with Uncertainty for Neural Speaker Embedding
Figure 2 for Cosine Scoring with Uncertainty for Neural Speaker Embedding
Figure 3 for Cosine Scoring with Uncertainty for Neural Speaker Embedding
Figure 4 for Cosine Scoring with Uncertainty for Neural Speaker Embedding
Viaarxiv icon

VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis

Add code
Bookmark button
Alert button
Mar 01, 2024
Weiwei Lin, Chenhang He, Man-Wai Mak, Jiachen Lian, Kong Aik Lee

Figure 1 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 2 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 3 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 4 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Viaarxiv icon

Generalizing Speaker Verification for Spoof Awareness in the Embedding Space

Add code
Bookmark button
Alert button
Jan 28, 2024
Xuechen Liu, Md Sahidullah, Kong Aik Lee, Tomi Kinnunen

Viaarxiv icon

Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio

Add code
Bookmark button
Alert button
Jan 05, 2024
Yi Ma, Kong Aik Lee, Ville Hautamäki, Meng Ge, Haizhou Li

Viaarxiv icon

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification

Add code
Bookmark button
Alert button
Dec 06, 2023
Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li

Viaarxiv icon

An Empirical Bayes Framework for Open-Domain Dialogue Generation

Add code
Bookmark button
Alert button
Nov 18, 2023
Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan

Viaarxiv icon

Partially Randomizing Transformer Weights for Dialogue Response Diversity

Add code
Bookmark button
Alert button
Nov 18, 2023
Jing Yang Lee, Kong Aik Lee, Woon-Seng Gan

Viaarxiv icon

Disentangling Voice and Content with Self-Supervision for Speaker Recognition

Add code
Bookmark button
Alert button
Oct 02, 2023
Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li

Figure 1 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 2 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 3 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 4 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Viaarxiv icon

Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification

Add code
Bookmark button
Alert button
Sep 26, 2023
Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng

Figure 1 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 2 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 3 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 4 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Viaarxiv icon

The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR

Add code
Bookmark button
Alert button
Sep 24, 2023
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu

Figure 1 for The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Figure 2 for The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Figure 3 for The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Figure 4 for The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR
Viaarxiv icon