Alert button
Picture for Anurag Kumar

Anurag Kumar

Alert button

AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis

Add code
Bookmark button
Alert button
Feb 07, 2023
Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu

Figure 1 for AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
Figure 2 for AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
Figure 3 for AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
Figure 4 for AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis
Viaarxiv icon

Rethinking complex-valued deep neural networks for monaural speech enhancement

Add code
Bookmark button
Alert button
Jan 11, 2023
Haibin Wu, Ke Tan, Buye Xu, Anurag Kumar, Daniel Wong

Figure 1 for Rethinking complex-valued deep neural networks for monaural speech enhancement
Figure 2 for Rethinking complex-valued deep neural networks for monaural speech enhancement
Figure 3 for Rethinking complex-valued deep neural networks for monaural speech enhancement
Figure 4 for Rethinking complex-valued deep neural networks for monaural speech enhancement
Viaarxiv icon

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

Add code
Bookmark button
Alert button
Nov 20, 2022
Rodrigo Mira, Buye Xu, Jacob Donley, Anurag Kumar, Stavros Petridis, Vamsi Krishna Ithapu, Maja Pantic

Figure 1 for LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Figure 2 for LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Figure 3 for LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Figure 4 for LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Viaarxiv icon

Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement

Add code
Bookmark button
Alert button
Nov 16, 2022
Kuan-Lin Chen, Daniel D. E. Wong, Ke Tan, Buye Xu, Anurag Kumar, Vamsi Krishna Ithapu

Figure 1 for Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Figure 2 for Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Figure 3 for Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement
Viaarxiv icon

Improving Speech Enhancement through Fine-Grained Speech Characteristics

Add code
Bookmark button
Alert button
Jul 11, 2022
Muqiao Yang, Joseph Konan, David Bick, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

Figure 1 for Improving Speech Enhancement through Fine-Grained Speech Characteristics
Figure 2 for Improving Speech Enhancement through Fine-Grained Speech Characteristics
Figure 3 for Improving Speech Enhancement through Fine-Grained Speech Characteristics
Figure 4 for Improving Speech Enhancement through Fine-Grained Speech Characteristics
Viaarxiv icon

SAQAM: Spatial Audio Quality Assessment Metric

Add code
Bookmark button
Alert button
Jun 24, 2022
Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia

Figure 1 for SAQAM: Spatial Audio Quality Assessment Metric
Figure 2 for SAQAM: Spatial Audio Quality Assessment Metric
Figure 3 for SAQAM: Spatial Audio Quality Assessment Metric
Figure 4 for SAQAM: Spatial Audio Quality Assessment Metric
Viaarxiv icon

Speech Quality Assessment through MOS using Non-Matching References

Add code
Bookmark button
Alert button
Jun 24, 2022
Pranay Manocha, Anurag Kumar

Figure 1 for Speech Quality Assessment through MOS using Non-Matching References
Figure 2 for Speech Quality Assessment through MOS using Non-Matching References
Figure 3 for Speech Quality Assessment through MOS using Non-Matching References
Figure 4 for Speech Quality Assessment through MOS using Non-Matching References
Viaarxiv icon

RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing

Add code
Bookmark button
Alert button
Feb 22, 2022
Efthymios Tzinis, Yossi Adi, Vamsi Krishna Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar

Figure 1 for RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Figure 2 for RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Figure 3 for RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Figure 4 for RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Viaarxiv icon