Alert button
Picture for Kaustubh Kalgaonkar

Kaustubh Kalgaonkar

Alert button

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Add code
Bookmark button
Alert button
Sep 20, 2023
Tiantian Feng, Ju Lin, Yiteng Huang, Weipeng He, Kaustubh Kalgaonkar, Niko Moritz, Li Wan, Xin Lei, Ming Sun, Frank Seide

Figure 1 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 2 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 3 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 4 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Viaarxiv icon

Egocentric Audio-Visual Noise Suppression

Add code
Bookmark button
Alert button
Nov 07, 2022
Roshan Sharma, Weipeng He, Ju Lin, Egor Lakomkin, Yang Liu, Kaustubh Kalgaonkar

Figure 1 for Egocentric Audio-Visual Noise Suppression
Figure 2 for Egocentric Audio-Visual Noise Suppression
Figure 3 for Egocentric Audio-Visual Noise Suppression
Figure 4 for Egocentric Audio-Visual Noise Suppression
Viaarxiv icon

SCA: Streaming Cross-attention Alignment for Echo Cancellation

Add code
Bookmark button
Alert button
Nov 01, 2022
Yang Liu, Yangyang Shi, Yun Li, Kaustubh Kalgaonkar, Sriram Srinivasan, Xin Lei

Figure 1 for SCA: Streaming Cross-attention Alignment for Echo Cancellation
Figure 2 for SCA: Streaming Cross-attention Alignment for Echo Cancellation
Figure 3 for SCA: Streaming Cross-attention Alignment for Echo Cancellation
Figure 4 for SCA: Streaming Cross-attention Alignment for Echo Cancellation
Viaarxiv icon

RNN-T For Latency Controlled ASR With Improved Beam Search

Add code
Bookmark button
Alert button
Nov 05, 2019
Mahaveer Jain, Kjell Schubert, Jay Mahadeokar, Ching-Feng Yeh, Kaustubh Kalgaonkar, Anuroop Sriram, Christian Fuegen, Michael L. Seltzer

Figure 1 for RNN-T For Latency Controlled ASR With Improved Beam Search
Figure 2 for RNN-T For Latency Controlled ASR With Improved Beam Search
Figure 3 for RNN-T For Latency Controlled ASR With Improved Beam Search
Figure 4 for RNN-T For Latency Controlled ASR With Improved Beam Search
Viaarxiv icon

Transformer-Transducer: End-to-End Speech Recognition with Self-Attention

Add code
Bookmark button
Alert button
Oct 28, 2019
Ching-Feng Yeh, Jay Mahadeokar, Kaustubh Kalgaonkar, Yongqiang Wang, Duc Le, Mahaveer Jain, Kjell Schubert, Christian Fuegen, Michael L. Seltzer

Figure 1 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 2 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 3 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 4 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Viaarxiv icon