Alert button
Picture for Anurag Kumar

Anurag Kumar

Alert button

Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark

Add code
Bookmark button
Alert button
Mar 27, 2024
Ziyang Chen, Israel D. Gebru, Christian Richardt, Anurag Kumar, William Laney, Andrew Owens, Alexander Richard

Viaarxiv icon

A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

Add code
Bookmark button
Alert button
Mar 03, 2024
Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar

Figure 1 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 2 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 3 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Figure 4 for A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement
Viaarxiv icon

Ambisonics Networks -- The Effect Of Radial Functions Regularization

Add code
Bookmark button
Alert button
Feb 29, 2024
Bar Shaybet, Anurag Kumar, Vladimir Tourbabin, Boaz Rafaely

Viaarxiv icon

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Bookmark button
Alert button
Oct 27, 2023
Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis

Figure 1 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 2 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 3 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 4 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Viaarxiv icon

Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields

Add code
Bookmark button
Alert button
Sep 27, 2023
Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu

Figure 1 for Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields
Figure 2 for Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields
Figure 3 for Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields
Figure 4 for Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields
Viaarxiv icon

DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models

Add code
Bookmark button
Alert button
Jul 31, 2023
Chao Huang, Susan Liang, Yapeng Tian, Anurag Kumar, Chenliang Xu

Figure 1 for DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models
Figure 2 for DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models
Figure 3 for DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models
Figure 4 for DAVIS: High-Quality Audio-Visual Separation with Generative Diffusion Models
Viaarxiv icon

TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio

Add code
Bookmark button
Alert button
Apr 04, 2023
Anurag Kumar, Ke Tan, Zhaoheng Ni, Pranay Manocha, Xiaohui Zhang, Ethan Henderson, Buye Xu

Figure 1 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 2 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 3 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Figure 4 for TorchAudio-Squim: Reference-less Speech Quality and Intelligibility measures in TorchAudio
Viaarxiv icon

Egocentric Audio-Visual Object Localization

Add code
Bookmark button
Alert button
Mar 23, 2023
Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu

Figure 1 for Egocentric Audio-Visual Object Localization
Figure 2 for Egocentric Audio-Visual Object Localization
Figure 3 for Egocentric Audio-Visual Object Localization
Figure 4 for Egocentric Audio-Visual Object Localization
Viaarxiv icon

PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement

Add code
Bookmark button
Alert button
Feb 16, 2023
Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

Figure 1 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Figure 2 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Figure 3 for PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement
Viaarxiv icon

TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement

Add code
Bookmark button
Alert button
Feb 16, 2023
Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

Figure 1 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 2 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 3 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Figure 4 for TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement
Viaarxiv icon