Picture for Rohan Kumar Das

Rohan Kumar Das

EnvSDD: Benchmarking Environmental Sound Deepfake Detection

Add code
May 25, 2025
Viaarxiv icon

Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages

Add code
May 20, 2025
Viaarxiv icon

AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation

Add code
May 20, 2025
Viaarxiv icon

Listen, Analyze, and Adapt to Learn New Attacks: An Exemplar-Free Class Incremental Learning Method for Audio Deepfake Source Tracing

Add code
May 20, 2025
Viaarxiv icon

AnalyticKWS: Towards Exemplar-Free Analytic Class Incremental Learning for Small-footprint Keyword Spotting

Add code
May 17, 2025
Viaarxiv icon

Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing

Add code
Apr 08, 2025
Viaarxiv icon

Multi-modal Speech Enhancement with Limited Electromyography Channels

Add code
Jan 11, 2025
Figure 1 for Multi-modal Speech Enhancement with Limited Electromyography Channels
Figure 2 for Multi-modal Speech Enhancement with Limited Electromyography Channels
Figure 3 for Multi-modal Speech Enhancement with Limited Electromyography Channels
Viaarxiv icon

XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack Detection

Add code
Nov 15, 2024
Viaarxiv icon

Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection

Add code
Nov 02, 2024
Viaarxiv icon

TF-Mamba: A Time-Frequency Network for Sound Source Localization

Add code
Sep 08, 2024
Viaarxiv icon