Picture for Ramani Duraiswami

Ramani Duraiswami

Music Flamingo: Scaling Music Understanding in Audio Language Models

Add code
Nov 13, 2025
Viaarxiv icon

SPUR: A Plug-and-Play Framework for Integrating Spatial Audio Understanding and Reasoning into Large Audio-Language Models

Add code
Nov 13, 2025
Viaarxiv icon

AURA: A Fine-Grained Benchmark and Decomposed Metric for Audio-Visual Reasoning

Add code
Aug 10, 2025
Viaarxiv icon

Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge

Add code
May 12, 2025
Viaarxiv icon

ProSE: Diffusion Priors for Speech Enhancement

Add code
Mar 09, 2025
Figure 1 for ProSE: Diffusion Priors for Speech Enhancement
Figure 2 for ProSE: Diffusion Priors for Speech Enhancement
Figure 3 for ProSE: Diffusion Priors for Speech Enhancement
Figure 4 for ProSE: Diffusion Priors for Speech Enhancement
Viaarxiv icon

Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs

Add code
Feb 10, 2025
Figure 1 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 2 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 3 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Figure 4 for Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs
Viaarxiv icon

3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering

Add code
Jan 14, 2025
Figure 1 for 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering
Figure 2 for 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering
Figure 3 for 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering
Figure 4 for 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering
Viaarxiv icon

TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification

Add code
Dec 31, 2024
Viaarxiv icon

Applying Automatic Differentiation to Optimize Differential Microphone Array Designs

Add code
Dec 06, 2024
Viaarxiv icon

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Add code
Oct 24, 2024
Figure 1 for MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Figure 2 for MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Figure 3 for MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Figure 4 for MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Viaarxiv icon