Alert button

"speech recognition": models, code, and papers
Alert button

Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription

Add code
Bookmark button
Alert button
Jul 02, 2021
Nikita Pavlichenko, Ivan Stelmakh, Dmitry Ustalov

Figure 1 for Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription
Figure 2 for Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription
Figure 3 for Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription
Figure 4 for Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription
Viaarxiv icon

Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature

Nov 22, 2021
Yiwen Shao, Shi-Xiong Zhang, Dong Yu

Figure 1 for Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Figure 2 for Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Figure 3 for Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Figure 4 for Multi-Channel Multi-Speaker ASR Using 3D Spatial Feature
Viaarxiv icon

AI Accelerator Survey and Trends

Add code
Bookmark button
Alert button
Sep 18, 2021
Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner

Figure 1 for AI Accelerator Survey and Trends
Figure 2 for AI Accelerator Survey and Trends
Figure 3 for AI Accelerator Survey and Trends
Figure 4 for AI Accelerator Survey and Trends
Viaarxiv icon

Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora

Sep 23, 2021
Szu-Jui Chen, Wei Xia, John H. L. Hansen

Figure 1 for Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora
Figure 2 for Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora
Figure 3 for Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora
Figure 4 for Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora
Viaarxiv icon

Differentially Private Speaker Anonymization

Feb 23, 2022
Ali Shahin Shamsabadi, Brij Mohan Lal Srivastava, Aurélien Bellet, Nathalie Vauquier, Emmanuel Vincent, Mohamed Maouche, Marc Tommasi, Nicolas Papernot

Figure 1 for Differentially Private Speaker Anonymization
Figure 2 for Differentially Private Speaker Anonymization
Figure 3 for Differentially Private Speaker Anonymization
Figure 4 for Differentially Private Speaker Anonymization
Viaarxiv icon

Sequence-level self-learning with multiple hypotheses

Dec 10, 2021
Kenichi Kumatani, Dimitrios Dimitriadis, Yashesh Gaur, Robert Gmyr, Sefik Emre Eskimez, Jinyu Li, Michael Zeng

Figure 1 for Sequence-level self-learning with multiple hypotheses
Figure 2 for Sequence-level self-learning with multiple hypotheses
Figure 3 for Sequence-level self-learning with multiple hypotheses
Figure 4 for Sequence-level self-learning with multiple hypotheses
Viaarxiv icon

Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification

Add code
Bookmark button
Alert button
Feb 15, 2021
Bidisha Sharma, Maulik Madhavi, Haizhou Li

Figure 1 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Figure 2 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Figure 3 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Figure 4 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Viaarxiv icon

Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition

Apr 19, 2017
Shubham Toshniwal, Hao Tang, Liang Lu, Karen Livescu

Figure 1 for Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition
Figure 2 for Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition
Figure 3 for Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition
Viaarxiv icon

The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation

Add code
Bookmark button
Alert button
Aug 09, 2021
Minghan Wang, Yuxia Wang, Chang Su, Jiaxin Guo, Yingtao Zhang, Yujia Liu, Min Zhang, Shimin Tao, Xingshan Zeng, Liangyou Li, Hao Yang, Ying Qin

Figure 1 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 2 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 3 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 4 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Viaarxiv icon

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Aug 04, 2021
Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexis Conneau, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli

Figure 1 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 2 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 3 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 4 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Viaarxiv icon