Alert button
Picture for George Saon

George Saon

Alert button

Exploring the limits of decoder-only models trained on public speech recognition corpora

Add code
Bookmark button
Alert button
Jan 31, 2024
Ankit Gupta, George Saon, Brian Kingsbury

Viaarxiv icon

Soft Random Sampling: A Theoretical and Empirical Analysis

Add code
Bookmark button
Alert button
Nov 24, 2023
Xiaodong Cui, Ashish Mittal, Songtao Lu, Wei Zhang, George Saon, Brian Kingsbury

Viaarxiv icon

Semi-Autoregressive Streaming ASR With Label Context

Add code
Bookmark button
Alert button
Sep 19, 2023
Siddhant Arora, George Saon, Shinji Watanabe, Brian Kingsbury

Figure 1 for Semi-Autoregressive Streaming ASR With Label Context
Figure 2 for Semi-Autoregressive Streaming ASR With Label Context
Figure 3 for Semi-Autoregressive Streaming ASR With Label Context
Figure 4 for Semi-Autoregressive Streaming ASR With Label Context
Viaarxiv icon

Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems

Add code
Bookmark button
Alert button
Sep 07, 2023
Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon

Viaarxiv icon

Diagonal State Space Augmented Transformers for Speech Recognition

Add code
Bookmark button
Alert button
Feb 27, 2023
George Saon, Ankit Gupta, Xiaodong Cui

Figure 1 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 2 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 3 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 4 for Diagonal State Space Augmented Transformers for Speech Recognition
Viaarxiv icon

VQ-T: RNN Transducers using Vector-Quantized Prediction Network States

Add code
Bookmark button
Alert button
Aug 03, 2022
Jiatong Shi, George Saon, David Haws, Shinji Watanabe, Brian Kingsbury

Figure 1 for VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
Figure 2 for VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
Figure 3 for VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
Figure 4 for VQ-T: RNN Transducers using Vector-Quantized Prediction Network States
Viaarxiv icon

Extending RNN-T-based speech recognition systems with emotion and language classification

Add code
Bookmark button
Alert button
Jul 28, 2022
Zvi Kons, Hagai Aronowitz, Edmilson Morais, Matheus Damasceno, Hong-Kwang Kuo, Samuel Thomas, George Saon

Figure 1 for Extending RNN-T-based speech recognition systems with emotion and language classification
Figure 2 for Extending RNN-T-based speech recognition systems with emotion and language classification
Figure 3 for Extending RNN-T-based speech recognition systems with emotion and language classification
Figure 4 for Extending RNN-T-based speech recognition systems with emotion and language classification
Viaarxiv icon

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization

Add code
Bookmark button
Alert button
Jun 16, 2022
Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Kailash Gopalakrishnan

Figure 1 for Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
Figure 2 for Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
Figure 3 for Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
Viaarxiv icon

Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems

Add code
Bookmark button
Alert button
Apr 01, 2022
Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon

Figure 1 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 2 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 3 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Figure 4 for Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Viaarxiv icon