Alert button
Picture for Kong Aik Lee

Kong Aik Lee

Alert button

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech

Oct 11, 2022
Xiaohui Liu, Meng Liu, Lin Zhang, Linjuan Zhang, Chang Zeng, Kai Li, Nan Li, Kong Aik Lee, Longbiao Wang, Jianwu Dang

Figure 1 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 2 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 3 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Figure 4 for Deep Spectro-temporal Artifacts for Detecting Synthesized Speech
Viaarxiv icon

ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild

Oct 05, 2022
Xuechen Liu, Xin Wang, Md Sahidullah, Jose Patino, Héctor Delgado, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas Evans, Andreas Nautsch, Kong Aik Lee

Figure 1 for ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild
Figure 2 for ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild
Figure 3 for ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild
Figure 4 for ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild
Viaarxiv icon

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Aug 17, 2022
Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan

Figure 1 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 2 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 3 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Figure 4 for The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines
Viaarxiv icon

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion

Apr 21, 2022
Hye-jin Shim, Hemlata Tak, Xuechen Liu, Hee-Soo Heo, Jee-weon Jung, Joon Son Chung, Soo-Whan Chung, Ha-Jin Yu, Bong-Jin Lee, Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md Sahidullah, Tomi Kinnunen, Nicholas Evans

Figure 1 for Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Figure 2 for Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Figure 3 for Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Figure 4 for Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
Viaarxiv icon

Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?

Apr 11, 2022
Qiongqiong Wang, Kong Aik Lee, Tianchi Liu

Figure 1 for Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
Figure 2 for Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
Figure 3 for Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
Figure 4 for Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
Viaarxiv icon

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

Feb 15, 2022
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li

Figure 1 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Figure 2 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Figure 3 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Viaarxiv icon

Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents

Feb 12, 2022
Jing Yang Lee, Kong Aik Lee, Woon Seng Gan

Figure 1 for Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents
Figure 2 for Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents
Figure 3 for Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents
Figure 4 for Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents
Viaarxiv icon

Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge

Feb 08, 2022
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu

Figure 1 for Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Figure 2 for Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Figure 3 for Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Figure 4 for Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge
Viaarxiv icon

DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation

Nov 22, 2021
Jing Yang Lee, Kong Aik Lee, Woon Seng Gan

Figure 1 for DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation
Figure 2 for DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation
Figure 3 for DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation
Figure 4 for DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation
Viaarxiv icon