Picture for Venkatesh Ravichandran

Venkatesh Ravichandran

The Interspeech 2025 Speech Accessibility Project Challenge

Add code
Jul 29, 2025
Viaarxiv icon

Mitigating Bad Ground Truth in Supervised Machine Learning based Crop Classification: A Multi-Level Framework with Sentinel-2 Images

Add code
Mar 14, 2025
Viaarxiv icon

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Add code
Mar 28, 2024
Viaarxiv icon

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Add code
Jan 26, 2024
Figure 1 for Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
Figure 2 for Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
Figure 3 for Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
Figure 4 for Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion
Viaarxiv icon

Two-pass Endpoint Detection for Speech Recognition

Add code
Jan 17, 2024
Viaarxiv icon

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Add code
Dec 22, 2023
Figure 1 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 2 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 3 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 4 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Viaarxiv icon

Improving fairness for spoken language understanding in atypical speech with Text-to-Speech

Add code
Nov 16, 2023
Figure 1 for Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
Figure 2 for Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
Figure 3 for Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
Figure 4 for Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
Viaarxiv icon

Cross-utterance ASR Rescoring with Graph-based Label Propagation

Add code
Mar 27, 2023
Viaarxiv icon

Adaptive Endpointing with Deep Contextual Multi-armed Bandits

Add code
Mar 23, 2023
Viaarxiv icon

Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

Add code
Nov 04, 2022
Figure 1 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 2 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 3 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 4 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Viaarxiv icon