Alert button
Picture for Rohan Kumar Das

Rohan Kumar Das

Alert button

Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan

Add code
Bookmark button
Alert button
Apr 16, 2024
Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf

Viaarxiv icon

Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training

Add code
Bookmark button
Alert button
Apr 01, 2024
Ruijie Tao, Xinyuan Qian, Rohan Kumar Das, Xiaoxue Gao, Jiadong Wang, Haizhou Li

Viaarxiv icon

Dual Knowledge Distillation for Efficient Sound Event Detection

Add code
Bookmark button
Alert button
Feb 05, 2024
Yang Xiao, Rohan Kumar Das

Viaarxiv icon

Adaptive-avg-pooling based Attention Vision Transformer for Face Anti-spoofing

Add code
Bookmark button
Alert button
Jan 10, 2024
Jichen Yang, Fangfan Chen, Rohan Kumar Das, Zhengyu Zhu, Shunsi Zhang

Viaarxiv icon

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds

Add code
Bookmark button
Alert button
May 18, 2023
Tanmay Khandelwal, Rohan Kumar Das

Figure 1 for A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Figure 2 for A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Figure 3 for A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Figure 4 for A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Viaarxiv icon

Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions

Add code
Bookmark button
Alert button
Apr 25, 2023
Tanmay Khandelwal, Rohan Kumar Das, Andrew Koh, Eng Siong Chng

Figure 1 for Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Figure 2 for Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Figure 3 for Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Figure 4 for Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions
Viaarxiv icon

I4U System Description for NIST SRE'20 CTS Challenge

Add code
Bookmark button
Alert button
Nov 02, 2022
Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang, Luis Buera

Figure 1 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 2 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 3 for I4U System Description for NIST SRE'20 CTS Challenge
Figure 4 for I4U System Description for NIST SRE'20 CTS Challenge
Viaarxiv icon

Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs

Add code
Bookmark button
Alert button
Oct 27, 2022
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li

Figure 1 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 2 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 3 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 4 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Viaarxiv icon

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

Add code
Bookmark button
Alert button
Feb 15, 2022
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li

Figure 1 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Figure 2 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Figure 3 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Viaarxiv icon