Picture for Prashanth Gurunath Shivakumar

Prashanth Gurunath Shivakumar

Multi-Modal Retrieval For Large Language Model Based Speech Recognition

Add code
Jun 13, 2024
Figure 1 for Multi-Modal Retrieval For Large Language Model Based Speech Recognition
Figure 2 for Multi-Modal Retrieval For Large Language Model Based Speech Recognition
Figure 3 for Multi-Modal Retrieval For Large Language Model Based Speech Recognition
Figure 4 for Multi-Modal Retrieval For Large Language Model Based Speech Recognition
Viaarxiv icon

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

Add code
Jan 17, 2024
Viaarxiv icon

Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks

Add code
Jan 05, 2024
Figure 1 for Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Figure 2 for Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Figure 3 for Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Figure 4 for Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Viaarxiv icon

Discriminative Speech Recognition Rescoring with Pre-trained Language Models

Add code
Oct 10, 2023
Figure 1 for Discriminative Speech Recognition Rescoring with Pre-trained Language Models
Figure 2 for Discriminative Speech Recognition Rescoring with Pre-trained Language Models
Viaarxiv icon

Personalization for BERT-based Discriminative Speech Recognition Rescoring

Add code
Jul 13, 2023
Figure 1 for Personalization for BERT-based Discriminative Speech Recognition Rescoring
Figure 2 for Personalization for BERT-based Discriminative Speech Recognition Rescoring
Figure 3 for Personalization for BERT-based Discriminative Speech Recognition Rescoring
Figure 4 for Personalization for BERT-based Discriminative Speech Recognition Rescoring
Viaarxiv icon

Scaling Laws for Discriminative Speech Recognition Rescoring Models

Add code
Jun 27, 2023
Figure 1 for Scaling Laws for Discriminative Speech Recognition Rescoring Models
Figure 2 for Scaling Laws for Discriminative Speech Recognition Rescoring Models
Figure 3 for Scaling Laws for Discriminative Speech Recognition Rescoring Models
Figure 4 for Scaling Laws for Discriminative Speech Recognition Rescoring Models
Viaarxiv icon

Distillation Strategies for Discriminative Speech Recognition Rescoring

Add code
Jun 15, 2023
Figure 1 for Distillation Strategies for Discriminative Speech Recognition Rescoring
Figure 2 for Distillation Strategies for Discriminative Speech Recognition Rescoring
Figure 3 for Distillation Strategies for Discriminative Speech Recognition Rescoring
Figure 4 for Distillation Strategies for Discriminative Speech Recognition Rescoring
Viaarxiv icon

Phone Duration Modeling for Speaker Age Estimation in Children

Add code
Sep 03, 2021
Figure 1 for Phone Duration Modeling for Speaker Age Estimation in Children
Figure 2 for Phone Duration Modeling for Speaker Age Estimation in Children
Figure 3 for Phone Duration Modeling for Speaker Age Estimation in Children
Figure 4 for Phone Duration Modeling for Speaker Age Estimation in Children
Viaarxiv icon

Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords

Add code
Feb 19, 2021
Figure 1 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 2 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 3 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Figure 4 for Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with Subwords
Viaarxiv icon

End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study

Add code
Feb 19, 2021
Figure 1 for End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Figure 2 for End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Figure 3 for End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Figure 4 for End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Viaarxiv icon