Picture for Vladimir Bataev

Vladimir Bataev

NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding

Add code
May 28, 2025
Viaarxiv icon

WIND: Accelerated RNN-T Decoding with Windowed Inference for Non-blank Detection

Add code
May 19, 2025
Viaarxiv icon

RNN-Transducer-based Losses for Speech Recognition on Noisy Targets

Add code
Apr 09, 2025
Viaarxiv icon

TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer

Add code
Jan 10, 2025
Viaarxiv icon

Three-in-One: Fast and Accurate Transducer for Hybrid-Autoregressive ASR

Add code
Oct 03, 2024
Viaarxiv icon

Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter

Add code
Jun 11, 2024
Figure 1 for Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter
Figure 2 for Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter
Figure 3 for Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter
Figure 4 for Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter
Viaarxiv icon

Label-Looping: Highly Efficient Decoding for Transducers

Add code
Jun 10, 2024
Figure 1 for Label-Looping: Highly Efficient Decoding for Transducers
Figure 2 for Label-Looping: Highly Efficient Decoding for Transducers
Figure 3 for Label-Looping: Highly Efficient Decoding for Transducers
Figure 4 for Label-Looping: Highly Efficient Decoding for Transducers
Viaarxiv icon

Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU

Add code
Jun 06, 2024
Figure 1 for Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU
Figure 2 for Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU
Figure 3 for Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU
Figure 4 for Speed of Light Exact Greedy Decoding for RNN-T Speech Recognition Models on GPU
Viaarxiv icon

Powerful and Extensible WFST Framework for RNN-Transducer Losses

Add code
Mar 18, 2023
Viaarxiv icon

Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator

Add code
Feb 27, 2023
Figure 1 for Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
Figure 2 for Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
Figure 3 for Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
Figure 4 for Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
Viaarxiv icon