Picture for George Saon

George Saon

Self-Speculative Decoding for LLM-based ASR with CTC Encoder Drafts

Add code
Mar 11, 2026
Viaarxiv icon

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

Add code
Mar 09, 2026
Viaarxiv icon

Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities

Add code
May 14, 2025
Viaarxiv icon

A Non-autoregressive Model for Joint STT and TTS

Add code
Jan 15, 2025
Viaarxiv icon

Bilevel Joint Unsupervised and Supervised Training for Automatic Speech Recognition

Add code
Dec 11, 2024
Viaarxiv icon

Exploring the limits of decoder-only models trained on public speech recognition corpora

Add code
Jan 31, 2024
Figure 1 for Exploring the limits of decoder-only models trained on public speech recognition corpora
Figure 2 for Exploring the limits of decoder-only models trained on public speech recognition corpora
Figure 3 for Exploring the limits of decoder-only models trained on public speech recognition corpora
Figure 4 for Exploring the limits of decoder-only models trained on public speech recognition corpora
Viaarxiv icon

Soft Random Sampling: A Theoretical and Empirical Analysis

Add code
Nov 24, 2023
Figure 1 for Soft Random Sampling: A Theoretical and Empirical Analysis
Figure 2 for Soft Random Sampling: A Theoretical and Empirical Analysis
Figure 3 for Soft Random Sampling: A Theoretical and Empirical Analysis
Figure 4 for Soft Random Sampling: A Theoretical and Empirical Analysis
Viaarxiv icon

Semi-Autoregressive Streaming ASR With Label Context

Add code
Sep 19, 2023
Viaarxiv icon

Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems

Add code
Sep 07, 2023
Figure 1 for Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems
Figure 2 for Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems
Figure 3 for Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems
Figure 4 for Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems
Viaarxiv icon

Diagonal State Space Augmented Transformers for Speech Recognition

Add code
Feb 27, 2023
Viaarxiv icon