Picture for Sanjeel Parekh

Sanjeel Parekh

LTCI

ArrayDPS-Refine: Generative Refinement of Discriminative Multi-Channel Speech Enhancement

Add code
Mar 25, 2026
Viaarxiv icon

Unified Diffusion Refinement for Multi-Channel Speech Enhancement and Separation

Add code
Mar 25, 2026
Viaarxiv icon

Text-to-Stage: Spatial Layouts from Long-form Narratives

Add code
Mar 18, 2026
Viaarxiv icon

Conditional Flow Matching for Visually-Guided Acoustic Highlighting

Add code
Feb 03, 2026
Viaarxiv icon

Sound Event Detection with Boundary-Aware Optimization and Inference

Add code
Jan 07, 2026
Viaarxiv icon

Learning to Highlight Audio by Watching Movies

Add code
May 17, 2025
Figure 1 for Learning to Highlight Audio by Watching Movies
Figure 2 for Learning to Highlight Audio by Watching Movies
Figure 3 for Learning to Highlight Audio by Watching Movies
Figure 4 for Learning to Highlight Audio by Watching Movies
Viaarxiv icon

Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment

Add code
Jan 30, 2025
Figure 1 for Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
Figure 2 for Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
Figure 3 for Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
Figure 4 for Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
Viaarxiv icon

Tackling Interpretability in Audio Classification Networks with Non-negative Matrix Factorization

Add code
May 11, 2023
Viaarxiv icon

Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF

Add code
Feb 23, 2022
Figure 1 for Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF
Figure 2 for Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF
Figure 3 for Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF
Figure 4 for Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF
Viaarxiv icon

Emotion Transfer Using Vector-Valued Infinite Task Learning

Add code
Feb 09, 2021
Figure 1 for Emotion Transfer Using Vector-Valued Infinite Task Learning
Figure 2 for Emotion Transfer Using Vector-Valued Infinite Task Learning
Figure 3 for Emotion Transfer Using Vector-Valued Infinite Task Learning
Figure 4 for Emotion Transfer Using Vector-Valued Infinite Task Learning
Viaarxiv icon