Alert button

"speech recognition": models, code, and papers
Alert button

Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation

Sep 22, 2017
Wei-Ning Hsu, Yu Zhang, James Glass

Figure 1 for Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation
Figure 2 for Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation
Figure 3 for Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation
Figure 4 for Unsupervised Domain Adaptation for Robust Speech Recognition via Variational Autoencoder-Based Data Augmentation
Viaarxiv icon

ViDeBERTa: A powerful pre-trained language model for Vietnamese

Add code
Bookmark button
Alert button
Jan 25, 2023
Cong Dao Tran, Nhut Huy Pham, Anh Nguyen, Truong Son Hy, Tu Vu

Figure 1 for ViDeBERTa: A powerful pre-trained language model for Vietnamese
Figure 2 for ViDeBERTa: A powerful pre-trained language model for Vietnamese
Figure 3 for ViDeBERTa: A powerful pre-trained language model for Vietnamese
Figure 4 for ViDeBERTa: A powerful pre-trained language model for Vietnamese
Viaarxiv icon

NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement

May 20, 2022
Meng Yu, Yong Xu, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu

Figure 1 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 2 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 3 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Figure 4 for NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement
Viaarxiv icon

Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation

Aug 26, 2022
Zoey Liu, Justin Spence, Emily Prud'hommeaux

Figure 1 for Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Figure 2 for Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Figure 3 for Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Figure 4 for Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation
Viaarxiv icon

Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition

Jun 21, 2022
Einari Vaaras, Manu Airaksinen, Okko Räsänen

Figure 1 for Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Figure 2 for Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Figure 3 for Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Figure 4 for Analysis of Self-Supervised Learning and Dimensionality Reduction Methods in Clustering-Based Active Learning for Speech Emotion Recognition
Viaarxiv icon

End-to-end Anchored Speech Recognition

Feb 06, 2019
Yiming Wang, Xing Fan, I-Fan Chen, Yuzong Liu, Tongfei Chen, Björn Hoffmeister

Figure 1 for End-to-end Anchored Speech Recognition
Figure 2 for End-to-end Anchored Speech Recognition
Figure 3 for End-to-end Anchored Speech Recognition
Figure 4 for End-to-end Anchored Speech Recognition
Viaarxiv icon

Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers

Add code
Bookmark button
Alert button
Nov 08, 2020
Shucong Zhang, Erfan Loweimi, Peter Bell, Steve Renals

Figure 1 for Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers
Figure 2 for Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers
Figure 3 for Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers
Figure 4 for Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers
Viaarxiv icon

A Purely End-to-end System for Multi-speaker Speech Recognition

May 15, 2018
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux, John R. Hershey

Figure 1 for A Purely End-to-end System for Multi-speaker Speech Recognition
Figure 2 for A Purely End-to-end System for Multi-speaker Speech Recognition
Figure 3 for A Purely End-to-end System for Multi-speaker Speech Recognition
Figure 4 for A Purely End-to-end System for Multi-speaker Speech Recognition
Viaarxiv icon

User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning

Add code
Bookmark button
Alert button
Apr 05, 2022
Tiantian Feng, Raghuveer Peri, Shrikanth Narayanan

Figure 1 for User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning
Figure 2 for User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning
Figure 3 for User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning
Viaarxiv icon

Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset

Nov 14, 2022
Francesca Gasparini, Alessandra Grossi

Figure 1 for Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset
Figure 2 for Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset
Figure 3 for Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset
Figure 4 for Sentiment recognition of Italian elderly through domain adaptation on cross-corpus speech dataset
Viaarxiv icon