Alert button
Picture for Venkatesh Ravichandran

Venkatesh Ravichandran

Alert button

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2024
Yash Jain, David Chan, Pranav Dheram, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran, Shalini Ghosh

Figure 1 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 2 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 3 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 4 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Viaarxiv icon

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Add code
Bookmark button
Alert button
Jan 26, 2024
Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran

Viaarxiv icon

Two-pass Endpoint Detection for Speech Recognition

Add code
Bookmark button
Alert button
Jan 17, 2024
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

Viaarxiv icon

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Add code
Bookmark button
Alert button
Dec 22, 2023
Anirudh S. Sundar, Chao-Han Huck Yang, David M. Chan, Shalini Ghosh, Venkatesh Ravichandran, Phani Sankar Nidadavolu

Figure 1 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 2 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 3 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 4 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Viaarxiv icon

Improving fairness for spoken language understanding in atypical speech with Text-to-Speech

Add code
Bookmark button
Alert button
Nov 16, 2023
Helin Wang, Venkatesh Ravichandran, Milind Rao, Becky Lammers, Myra Sydnor, Nicholas Maragakis, Ankur A. Butala, Jayne Zhang, Lora Clawson, Victoria Chovaz, Laureano Moro-Velazquez

Viaarxiv icon

Cross-utterance ASR Rescoring with Graph-based Label Propagation

Add code
Bookmark button
Alert button
Mar 27, 2023
Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran

Figure 1 for Cross-utterance ASR Rescoring with Graph-based Label Propagation
Figure 2 for Cross-utterance ASR Rescoring with Graph-based Label Propagation
Figure 3 for Cross-utterance ASR Rescoring with Graph-based Label Propagation
Figure 4 for Cross-utterance ASR Rescoring with Graph-based Label Propagation
Viaarxiv icon

Adaptive Endpointing with Deep Contextual Multi-armed Bandits

Add code
Bookmark button
Alert button
Mar 23, 2023
Do June Min, Andreas Stolcke, Anirudh Raju, Colin Vaz, Di He, Venkatesh Ravichandran, Viet Anh Trinh

Figure 1 for Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Figure 2 for Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Figure 3 for Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Figure 4 for Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Viaarxiv icon

Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

Add code
Bookmark button
Alert button
Nov 04, 2022
Xin Zhang, Iván Vallés-Pérez, Andreas Stolcke, Chengzhu Yu, Jasha Droppo, Olabanji Shonibare, Roberto Barra-Chicote, Venkatesh Ravichandran

Figure 1 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 2 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 3 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 4 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Viaarxiv icon

Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification

Add code
Bookmark button
Alert button
Jul 08, 2022
Long Chen, Yixiong Meng, Venkatesh Ravichandran, Andreas Stolcke

Figure 1 for Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification
Figure 2 for Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification
Figure 3 for Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification
Viaarxiv icon

Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass

Add code
Bookmark button
Alert button
Feb 08, 2022
Olabanji Shonibare, Xiaosu Tong, Venkatesh Ravichandran

Viaarxiv icon