Large Vocabulary Continuous Speech


LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors

Add code
May 16, 2025
Viaarxiv icon

Literary and Colloquial Tamil Dialect Identification

Add code
Aug 25, 2024
Viaarxiv icon

mmWave-Whisper: Phone Call Eavesdropping and Transcription Using Millimeter-Wave Radar

Add code
Oct 22, 2024
Figure 1 for mmWave-Whisper: Phone Call Eavesdropping and Transcription Using Millimeter-Wave Radar
Figure 2 for mmWave-Whisper: Phone Call Eavesdropping and Transcription Using Millimeter-Wave Radar
Figure 3 for mmWave-Whisper: Phone Call Eavesdropping and Transcription Using Millimeter-Wave Radar
Figure 4 for mmWave-Whisper: Phone Call Eavesdropping and Transcription Using Millimeter-Wave Radar
Viaarxiv icon

CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge

Add code
Jun 14, 2024
Figure 1 for CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Figure 2 for CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Figure 3 for CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Figure 4 for CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge
Viaarxiv icon

Surrogate Gradient Spiking Neural Networks as Encoders for Large Vocabulary Continuous Speech Recognition

Add code
Dec 01, 2022
Figure 1 for Surrogate Gradient Spiking Neural Networks as Encoders for Large Vocabulary Continuous Speech Recognition
Figure 2 for Surrogate Gradient Spiking Neural Networks as Encoders for Large Vocabulary Continuous Speech Recognition
Figure 3 for Surrogate Gradient Spiking Neural Networks as Encoders for Large Vocabulary Continuous Speech Recognition
Figure 4 for Surrogate Gradient Spiking Neural Networks as Encoders for Large Vocabulary Continuous Speech Recognition
Viaarxiv icon

Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition

Add code
Jul 14, 2023
Viaarxiv icon

Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition

Add code
Feb 21, 2023
Viaarxiv icon

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

Add code
Mar 16, 2021
Figure 1 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 2 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 3 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 4 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Viaarxiv icon

Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy

Add code
Feb 03, 2021
Figure 1 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 2 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 3 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 4 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Viaarxiv icon

Applying GPGPU to Recurrent Neural Network Language Model based Fast Network Search in the Real-Time LVCSR

Add code
Jul 23, 2020
Figure 1 for Applying GPGPU to Recurrent Neural Network Language Model based Fast Network Search in the Real-Time LVCSR
Figure 2 for Applying GPGPU to Recurrent Neural Network Language Model based Fast Network Search in the Real-Time LVCSR
Figure 3 for Applying GPGPU to Recurrent Neural Network Language Model based Fast Network Search in the Real-Time LVCSR
Figure 4 for Applying GPGPU to Recurrent Neural Network Language Model based Fast Network Search in the Real-Time LVCSR
Viaarxiv icon