Alert button

"speech recognition": models, code, and papers
Alert button

A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding

Add code
Bookmark button
Alert button
Nov 04, 2021
Yingzhi Wang, Abdelmoumene Boumadane, Abdelwahab Heba

Figure 1 for A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Figure 2 for A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Figure 3 for A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Figure 4 for A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding
Viaarxiv icon

Language model fusion for streaming end to end speech recognition

Apr 09, 2021
Rodrigo Cabrera, Xiaofeng Liu, Mohammadreza Ghodsi, Zebulun Matteson, Eugene Weinstein, Anjuli Kannan

Figure 1 for Language model fusion for streaming end to end speech recognition
Figure 2 for Language model fusion for streaming end to end speech recognition
Figure 3 for Language model fusion for streaming end to end speech recognition
Figure 4 for Language model fusion for streaming end to end speech recognition
Viaarxiv icon

On Scaling Contrastive Representations for Low-Resource Speech Recognition

Add code
Bookmark button
Alert button
Feb 01, 2021
Lasse Borgholt, Tycho Max Sylvester Tax, Jakob Drachmann Havtorn, Lars Maaløe, Christian Igel

Figure 1 for On Scaling Contrastive Representations for Low-Resource Speech Recognition
Figure 2 for On Scaling Contrastive Representations for Low-Resource Speech Recognition
Figure 3 for On Scaling Contrastive Representations for Low-Resource Speech Recognition
Figure 4 for On Scaling Contrastive Representations for Low-Resource Speech Recognition
Viaarxiv icon

Two-Pass End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Aug 29, 2019
Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu

Figure 1 for Two-Pass End-to-End Speech Recognition
Figure 2 for Two-Pass End-to-End Speech Recognition
Figure 3 for Two-Pass End-to-End Speech Recognition
Figure 4 for Two-Pass End-to-End Speech Recognition
Viaarxiv icon

Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition

Oct 10, 2021
Guoli Ye, Vadim Mazalov, Jinyu Li, Yifan Gong

Figure 1 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 2 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 3 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Figure 4 for Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition
Viaarxiv icon

An analysis of degenerating speech due to progressive dysarthria on ASR performance

Oct 31, 2022
Katrin Tomanek, Katie Seaver, Pan-Pan Jiang, Richard Cave, Lauren Harrel, Jordan R. Green

Figure 1 for An analysis of degenerating speech due to progressive dysarthria on ASR performance
Figure 2 for An analysis of degenerating speech due to progressive dysarthria on ASR performance
Figure 3 for An analysis of degenerating speech due to progressive dysarthria on ASR performance
Figure 4 for An analysis of degenerating speech due to progressive dysarthria on ASR performance
Viaarxiv icon

Rationalizing Predictions by Adversarial Information Calibration

Add code
Bookmark button
Alert button
Jan 15, 2023
Lei Sha, Oana-Maria Camburu, Thomas Lukasiewicz

Figure 1 for Rationalizing Predictions by Adversarial Information Calibration
Figure 2 for Rationalizing Predictions by Adversarial Information Calibration
Figure 3 for Rationalizing Predictions by Adversarial Information Calibration
Figure 4 for Rationalizing Predictions by Adversarial Information Calibration
Viaarxiv icon

A bandit approach to curriculum generation for automatic speech recognition

Add code
Bookmark button
Alert button
Feb 06, 2021
Anastasia Kuznetsova, Anurag Kumar, Francis M. Tyers

Figure 1 for A bandit approach to curriculum generation for automatic speech recognition
Figure 2 for A bandit approach to curriculum generation for automatic speech recognition
Figure 3 for A bandit approach to curriculum generation for automatic speech recognition
Figure 4 for A bandit approach to curriculum generation for automatic speech recognition
Viaarxiv icon

A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding

Nov 10, 2022
Yifan Peng, Siddhant Arora, Yosuke Higuchi, Yushi Ueda, Sujay Kumar, Karthik Ganesan, Siddharth Dalmia, Xuankai Chang, Shinji Watanabe

Figure 1 for A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Figure 2 for A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Figure 3 for A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Figure 4 for A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Viaarxiv icon

A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability

Nov 04, 2022
Jian Xue, Peidong Wang, Jinyu Li, Eric Sun

Figure 1 for A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Figure 2 for A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Figure 3 for A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Figure 4 for A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability
Viaarxiv icon