Alert button
Picture for Ruijie Tao

Ruijie Tao

Alert button

Prompt-driven Target Speech Diarization

Oct 23, 2023
Yidi Jiang, Zhengyang Chen, Ruijie Tao, Liqun Deng, Yanmin Qian, Haizhou Li

Viaarxiv icon

Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification

Sep 26, 2023
Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng

Figure 1 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 2 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 3 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 4 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Viaarxiv icon

USED: Universal Speaker Extraction and Diarization

Sep 19, 2023
Junyi Ao, Mehmet Sinan Yıldırım, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li

Figure 1 for USED: Universal Speaker Extraction and Diarization
Figure 2 for USED: Universal Speaker Extraction and Diarization
Figure 3 for USED: Universal Speaker Extraction and Diarization
Figure 4 for USED: Universal Speaker Extraction and Diarization
Viaarxiv icon

Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech

Sep 15, 2023
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li

Viaarxiv icon

Target Active Speaker Detection with Audio-visual Cues

May 26, 2023
Yidi Jiang, Ruijie Tao, Zexu Pan, Haizhou Li

Figure 1 for Target Active Speaker Detection with Audio-visual Cues
Figure 2 for Target Active Speaker Detection with Audio-visual Cues
Figure 3 for Target Active Speaker Detection with Audio-visual Cues
Figure 4 for Target Active Speaker Detection with Audio-visual Cues
Viaarxiv icon

I4U System Description for NIST SRE'20 CTS Challenge

Nov 02, 2022
Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera

Figure 1 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 2 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 3 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 4 for I4U System Description for NIST SRE'20 CTS Challenge
Viaarxiv icon

Speaker recognition with two-step multi-modal deep cleansing

Oct 28, 2022
Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li

Figure 1 for Speaker recognition with two-step multi-modal deep cleansing
Figure 2 for Speaker recognition with two-step multi-modal deep cleansing
Figure 3 for Speaker recognition with two-step multi-modal deep cleansing
Figure 4 for Speaker recognition with two-step multi-modal deep cleansing
Viaarxiv icon

Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs

Oct 27, 2022
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li

Figure 1 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 2 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 3 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 4 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Viaarxiv icon

HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE

Nov 12, 2021
Rohan Kumar Das, Ruijie Tao, Haizhou Li

Figure 1 for HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE
Figure 2 for HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE
Viaarxiv icon