Alert button
Picture for Shalini Ghosh

Shalini Ghosh

Alert button

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2024
Yash Jain, David Chan, Pranav Dheram, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran, Shalini Ghosh

Figure 1 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 2 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 3 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 4 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Viaarxiv icon

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

Add code
Bookmark button
Alert button
Jan 17, 2024
Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan Bulyko

Viaarxiv icon

Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks

Add code
Bookmark button
Alert button
Jan 05, 2024
Kevin Everson, Yile Gu, Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-yi Lee, Ariya Rastrow, Andreas Stolcke

Viaarxiv icon

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jan 04, 2024
David M. Chan, Shalini Ghosh, Hitesh Tulsiani, Ariya Rastrow, Björn Hoffmeister

Viaarxiv icon

Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Add code
Bookmark button
Alert button
Dec 22, 2023
Anirudh S. Sundar, Chao-Han Huck Yang, David M. Chan, Shalini Ghosh, Venkatesh Ravichandran, Phani Sankar Nidadavolu

Figure 1 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 2 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 3 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Figure 4 for Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
Viaarxiv icon

JAB: Joint Adversarial Prompting and Belief Augmentation

Add code
Bookmark button
Alert button
Nov 16, 2023
Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Jwala Dhamala, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

Viaarxiv icon

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting

Add code
Bookmark button
Alert button
Oct 10, 2023
Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke

Figure 1 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 2 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 3 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 4 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Viaarxiv icon

Generative Speech Recognition Error Correction with Large Language Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke

Figure 1 for Generative Speech Recognition Error Correction with Large Language Models
Figure 2 for Generative Speech Recognition Error Correction with Large Language Models
Figure 3 for Generative Speech Recognition Error Correction with Large Language Models
Figure 4 for Generative Speech Recognition Error Correction with Large Language Models
Viaarxiv icon

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Add code
Bookmark button
Alert button
Sep 26, 2023
Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko

Figure 1 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 2 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 3 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 4 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Viaarxiv icon