Alert button

"speech recognition": models, code, and papers
Alert button

READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises

Add code
Bookmark button
Alert button
Feb 14, 2023
Chenglei Si, Zhengyan Zhang, Yingfa Chen, Xiaozhi Wang, Zhiyuan Liu, Maosong Sun

Figure 1 for READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Figure 2 for READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Figure 3 for READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Figure 4 for READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Viaarxiv icon

Speaker Normalization for Self-supervised Speech Emotion Recognition

Feb 02, 2022
Itai Gat, Hagai Aronowitz, Weizhong Zhu, Edmilson Morais, Ron Hoory

Figure 1 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Figure 2 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Figure 3 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Viaarxiv icon

Who Needs Words? Lexicon-Free Speech Recognition

Add code
Bookmark button
Alert button
Apr 09, 2019
Tatiana Likhomanenko, Gabriel Synnaeve, Ronan Collobert

Figure 1 for Who Needs Words? Lexicon-Free Speech Recognition
Figure 2 for Who Needs Words? Lexicon-Free Speech Recognition
Figure 3 for Who Needs Words? Lexicon-Free Speech Recognition
Figure 4 for Who Needs Words? Lexicon-Free Speech Recognition
Viaarxiv icon

A two-step approach to leverage contextual data: speech recognition in air-traffic communications

Feb 08, 2022
Iuliia Nigmatulina, Juan Zuluaga-Gomez, Amrutha Prasad, Seyyed Saeed Sarfjoo, Petr Motlicek

Figure 1 for A two-step approach to leverage contextual data: speech recognition in air-traffic communications
Figure 2 for A two-step approach to leverage contextual data: speech recognition in air-traffic communications
Figure 3 for A two-step approach to leverage contextual data: speech recognition in air-traffic communications
Figure 4 for A two-step approach to leverage contextual data: speech recognition in air-traffic communications
Viaarxiv icon

Amortized Neural Networks for Low-Latency Speech Recognition

Aug 03, 2021
Jonathan Macoskey, Grant P. Strimel, Jinru Su, Ariya Rastrow

Figure 1 for Amortized Neural Networks for Low-Latency Speech Recognition
Figure 2 for Amortized Neural Networks for Low-Latency Speech Recognition
Figure 3 for Amortized Neural Networks for Low-Latency Speech Recognition
Viaarxiv icon

Simple and Effective Unsupervised Speech Translation

Add code
Bookmark button
Alert button
Oct 18, 2022
Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino

Figure 1 for Simple and Effective Unsupervised Speech Translation
Figure 2 for Simple and Effective Unsupervised Speech Translation
Figure 3 for Simple and Effective Unsupervised Speech Translation
Figure 4 for Simple and Effective Unsupervised Speech Translation
Viaarxiv icon

Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data

Feb 13, 2023
Gorka Abad, Oguzhan Ersoy, Stjepan Picek, Aitor Urbieta

Figure 1 for Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Figure 2 for Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Figure 3 for Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Figure 4 for Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Viaarxiv icon

Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition

Add code
Bookmark button
Alert button
Sep 14, 2021
Felix Wu, Kwangyoun Kim, Jing Pan, Kyu Han, Kilian Q. Weinberger, Yoav Artzi

Figure 1 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 2 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 3 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 4 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Viaarxiv icon

Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction

Add code
Bookmark button
Alert button
Nov 23, 2022
Kai Shen, Yichong Leng, Xu Tan, Siliang Tang, Yuan Zhang, Wenjie Liu, Edward Lin

Figure 1 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 2 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 3 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 4 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Viaarxiv icon

Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages

Add code
Bookmark button
Alert button
Jan 17, 2021
Cheng Yi, Jianzhong Wang, Ning Cheng, Shiyu Zhou, Bo Xu

Figure 1 for Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages
Figure 2 for Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages
Figure 3 for Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages
Figure 4 for Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages
Viaarxiv icon