Picture for Yatharth Saraf

Yatharth Saraf

Pushing the performances of ASR models on English and Spanish accents

Add code
Dec 22, 2022
Figure 1 for Pushing the performances of ASR models on English and Spanish accents
Figure 2 for Pushing the performances of ASR models on English and Spanish accents
Figure 3 for Pushing the performances of ASR models on English and Spanish accents
Figure 4 for Pushing the performances of ASR models on English and Spanish accents
Viaarxiv icon

Improving Data Driven Inverse Text Normalization using Data Augmentation

Add code
Jul 20, 2022
Figure 1 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 2 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 3 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 4 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Viaarxiv icon

Scaling ASR Improves Zero and Few Shot Learning

Add code
Nov 29, 2021
Figure 1 for Scaling ASR Improves Zero and Few Shot Learning
Figure 2 for Scaling ASR Improves Zero and Few Shot Learning
Figure 3 for Scaling ASR Improves Zero and Few Shot Learning
Figure 4 for Scaling ASR Improves Zero and Few Shot Learning
Viaarxiv icon

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale

Add code
Nov 19, 2021
Figure 1 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 2 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 3 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 4 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Viaarxiv icon

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

Add code
Nov 18, 2021
Figure 1 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Figure 2 for Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Viaarxiv icon

Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks

Add code
Nov 10, 2021
Figure 1 for Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Figure 2 for Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Figure 3 for Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Figure 4 for Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Viaarxiv icon

Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings

Add code
Oct 08, 2021
Figure 1 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 2 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 3 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 4 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Viaarxiv icon

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Add code
Aug 04, 2021
Figure 1 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 2 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 3 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 4 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Viaarxiv icon

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models

Add code
Jul 09, 2021
Figure 1 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 2 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 3 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 4 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Viaarxiv icon

Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition

Add code
Jun 14, 2021
Figure 1 for Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition
Figure 2 for Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition
Figure 3 for Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition
Figure 4 for Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition
Viaarxiv icon