Alert button
Picture for Sameer Khurana

Sameer Khurana

Alert button

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization

Feb 27, 2024
Yoshiki Masuyama, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

NeuroHeed+: Improving Neuro-steered Speaker Extraction with Joint Auditory Attention Detection

Dec 12, 2023
Zexu Pan, Gordon Wichern, Francois G. Germain, Sameer Khurana, Jonathan Le Roux

Viaarxiv icon

Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction

Oct 30, 2023
Zexu Pan, Gordon Wichern, Yoshiki Masuyama, Francois G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

Generation or Replication: Auscultating Audio Latent Diffusion Models

Oct 16, 2023
Dimitrios Bralios, Gordon Wichern, François G. Germain, Zexu Pan, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

Direct Text to Speech Translation System using Acoustic Units

Sep 14, 2023
Victoria Mingote, Pablo Gimeno, Luis Vicente, Sameer Khurana, Antoine Laurent, Jarod Duret

Figure 1 for Direct Text to Speech Translation System using Acoustic Units
Figure 2 for Direct Text to Speech Translation System using Acoustic Units
Figure 3 for Direct Text to Speech Translation System using Acoustic Units
Figure 4 for Direct Text to Speech Translation System using Acoustic Units
Viaarxiv icon

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers

Jul 06, 2023
Yuan Gong, Sameer Khurana, Leonid Karlinsky, James Glass

Figure 1 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 2 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 3 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 4 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Viaarxiv icon

Improved Cross-Lingual Transfer Learning For Automatic Speech Translation

Jun 01, 2023
Sameer Khurana, Nauman Dawalatabad, Antoine Laurent, Luis Vicente, Pablo Gimeno, Victoria Mingote, James Glass

Figure 1 for Improved Cross-Lingual Transfer Learning For Automatic Speech Translation
Figure 2 for Improved Cross-Lingual Transfer Learning For Automatic Speech Translation
Figure 3 for Improved Cross-Lingual Transfer Learning For Automatic Speech Translation
Figure 4 for Improved Cross-Lingual Transfer Learning For Automatic Speech Translation
Viaarxiv icon

Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages

May 21, 2023
Andrew Rouditchenko, Sameer Khurana, Samuel Thomas, Rogerio Feris, Leonid Karlinsky, Hilde Kuehne, David Harwath, Brian Kingsbury, James Glass

Figure 1 for Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages
Figure 2 for Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages
Figure 3 for Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages
Viaarxiv icon

On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration

Nov 14, 2022
Nauman Dawalatabad, Sameer Khurana, Antoine Laurent, James Glass

Figure 1 for On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Figure 2 for On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Figure 3 for On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Figure 4 for On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Viaarxiv icon

SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation

May 17, 2022
Sameer Khurana, Antoine Laurent, James Glass

Figure 1 for SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
Figure 2 for SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
Figure 3 for SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
Figure 4 for SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation
Viaarxiv icon