Picture for Anurag Kumar

Anurag Kumar

Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting

Add code
Sep 22, 2024
Figure 1 for Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting
Figure 2 for Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting
Figure 3 for Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting
Figure 4 for Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting
Viaarxiv icon

Spherical World-Locking for Audio-Visual Localization in Egocentric Videos

Add code
Aug 09, 2024
Figure 1 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 2 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 3 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 4 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Viaarxiv icon

High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

Add code
Jul 04, 2024
Figure 1 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 2 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 3 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 4 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Viaarxiv icon

AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling

Add code
Jun 17, 2024
Viaarxiv icon

URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement

Add code
Jun 07, 2024
Figure 1 for URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Figure 2 for URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Figure 3 for URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Figure 4 for URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
Viaarxiv icon

Cross-Talk Reduction

Add code
May 30, 2024
Viaarxiv icon

Few Shot Class Incremental Learning using Vision-Language models

Add code
May 02, 2024
Viaarxiv icon

Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark

Add code
Mar 27, 2024
Viaarxiv icon

A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

Add code
Mar 03, 2024
Figure 1 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 2 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 3 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 4 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Viaarxiv icon

Ambisonics Networks -- The Effect Of Radial Functions Regularization

Add code
Feb 29, 2024
Viaarxiv icon