Alert button
Picture for Pranav Dheram

Pranav Dheram

Alert button

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2024
Yash Jain, David Chan, Pranav Dheram, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran, Shalini Ghosh

Figure 1 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 2 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 3 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 4 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Viaarxiv icon

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Add code
Bookmark button
Alert button
Jan 26, 2024
Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran

Viaarxiv icon

Mining Duplicate Questions of Stack Overflow

Add code
Bookmark button
Alert button
Oct 04, 2022
Mihir Kale, Anirudha Rayasam, Radhika Parik, Pranav Dheram

Figure 1 for Mining Duplicate Questions of Stack Overflow
Figure 2 for Mining Duplicate Questions of Stack Overflow
Figure 3 for Mining Duplicate Questions of Stack Overflow
Viaarxiv icon

Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities

Add code
Bookmark button
Alert button
Jul 22, 2022
Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke

Figure 1 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 2 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 3 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Figure 4 for Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Viaarxiv icon

End-to-End Spoken Language Understanding using RNN-Transducer ASR

Add code
Bookmark button
Alert button
Jul 08, 2021
Anirudh Raju, Gautam Tiwari, Milind Rao, Pranav Dheram, Bryan Anderson, Zhe Zhang, Bach Bui, Ariya Rastrow

Figure 1 for End-to-End Spoken Language Understanding using RNN-Transducer ASR
Figure 2 for End-to-End Spoken Language Understanding using RNN-Transducer ASR
Figure 3 for End-to-End Spoken Language Understanding using RNN-Transducer ASR
Figure 4 for End-to-End Spoken Language Understanding using RNN-Transducer ASR
Viaarxiv icon

Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding

Add code
Bookmark button
Alert button
Feb 12, 2021
Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke

Figure 1 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Figure 2 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Figure 3 for Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding
Viaarxiv icon

Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces

Add code
Bookmark button
Alert button
Aug 14, 2020
Milind Rao, Anirudh Raju, Pranav Dheram, Bach Bui, Ariya Rastrow

Figure 1 for Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Figure 2 for Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Figure 3 for Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Figure 4 for Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Viaarxiv icon