Picture for Andreas Stolcke

Andreas Stolcke

SRI International, Menlo Park, CA 94025

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Add code
Jan 26, 2024
Viaarxiv icon

Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models

Add code
Jan 23, 2024
Viaarxiv icon

Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition

Add code
Jan 19, 2024
Viaarxiv icon

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

Add code
Jan 17, 2024
Viaarxiv icon

Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks

Add code
Jan 05, 2024
Figure 1 for Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Figure 2 for Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Figure 3 for Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Figure 4 for Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
Viaarxiv icon

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting

Add code
Oct 10, 2023
Figure 1 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 2 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 3 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Figure 4 for Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
Viaarxiv icon

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Add code
Sep 26, 2023
Figure 1 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 2 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 3 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 4 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Viaarxiv icon

Learning When to Trust Which Teacher for Weakly Supervised ASR

Add code
Jun 21, 2023
Figure 1 for Learning When to Trust Which Teacher for Weakly Supervised ASR
Figure 2 for Learning When to Trust Which Teacher for Weakly Supervised ASR
Figure 3 for Learning When to Trust Which Teacher for Weakly Supervised ASR
Figure 4 for Learning When to Trust Which Teacher for Weakly Supervised ASR
Viaarxiv icon

Streaming Speech-to-Confusion Network Speech Recognition

Add code
Jun 02, 2023
Figure 1 for Streaming Speech-to-Confusion Network Speech Recognition
Figure 2 for Streaming Speech-to-Confusion Network Speech Recognition
Figure 3 for Streaming Speech-to-Confusion Network Speech Recognition
Figure 4 for Streaming Speech-to-Confusion Network Speech Recognition
Viaarxiv icon

PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers

Add code
Mar 30, 2023
Figure 1 for PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
Figure 2 for PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
Figure 3 for PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
Figure 4 for PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
Viaarxiv icon