Alert button
Picture for Rohan Kumar Das

Rohan Kumar Das

Alert button

Dual Knowledge Distillation for Efficient Sound Event Detection

Feb 05, 2024
Yang Xiao, Rohan Kumar Das

Viaarxiv icon

Adaptive-avg-pooling based Attention Vision Transformer for Face Anti-spoofing

Jan 10, 2024
Jichen Yang, Fangfan Chen, Rohan Kumar Das, Zhengyu Zhu, Shunsi Zhang

Viaarxiv icon

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds

May 18, 2023
Tanmay Khandelwal, Rohan Kumar Das

Figure 1 for A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Figure 2 for A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Figure 3 for A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Figure 4 for A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Viaarxiv icon

Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions

Apr 25, 2023
Tanmay Khandelwal, Rohan Kumar Das, Andrew Koh, Eng Siong Chng

Figure 1 for Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Figure 2 for Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Figure 3 for Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Figure 4 for Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Viaarxiv icon

I4U System Description for NIST SRE'20 CTS Challenge

Nov 02, 2022
Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera

Figure 1 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 2 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 3 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 4 for I4U System Description for NIST SRE'20 CTS Challenge
Viaarxiv icon

Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs

Oct 27, 2022
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li

Figure 1 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 2 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 3 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 4 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Viaarxiv icon

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

Feb 15, 2022
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li

Figure 1 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Figure 2 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Figure 3 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Viaarxiv icon

HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE

Nov 12, 2021
Rohan Kumar Das, Ruijie Tao, Haizhou Li

Figure 1 for HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE
Figure 2 for HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRE
Viaarxiv icon

Self-supervised Speaker Recognition with Loss-gated Learning

Oct 08, 2021
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li

Figure 1 for Self-supervised Speaker Recognition with Loss-gated Learning
Figure 2 for Self-supervised Speaker Recognition with Loss-gated Learning
Figure 3 for Self-supervised Speaker Recognition with Loss-gated Learning
Figure 4 for Self-supervised Speaker Recognition with Loss-gated Learning
Viaarxiv icon