Alert button

"speech recognition": models, code, and papers
Alert button

Multi-task Voice Activated Framework using Self-supervised Learning

Oct 12, 2021
Shehzeen Hussain, Van Nguyen, Shuhua Zhang, Erik Visser

Figure 1 for Multi-task Voice Activated Framework using Self-supervised Learning
Figure 2 for Multi-task Voice Activated Framework using Self-supervised Learning
Figure 3 for Multi-task Voice Activated Framework using Self-supervised Learning
Viaarxiv icon

Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition

Nov 08, 2012
Ridha Ejbali, Mourad Zaied, Chokri Ben Amar

Figure 1 for Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition
Figure 2 for Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition
Figure 3 for Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition
Figure 4 for Multi-input Multi-output Beta Wavelet Network: Modeling of Acoustic Units for Speech Recognition
Viaarxiv icon

DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis

Dec 09, 2020
Anurag Chowdhury, Arun Ross, Prabu David

Figure 1 for DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis
Figure 2 for DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis
Figure 3 for DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis
Figure 4 for DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis
Viaarxiv icon

Sequence Transduction with Graph-based Supervision

Nov 01, 2021
Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux

Figure 1 for Sequence Transduction with Graph-based Supervision
Figure 2 for Sequence Transduction with Graph-based Supervision
Figure 3 for Sequence Transduction with Graph-based Supervision
Viaarxiv icon

Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset

Nov 01, 2021
Soham Tiwari, Kshitiz Lakhotia, Manjunath Mulimani

Figure 1 for Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset
Figure 2 for Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset
Figure 3 for Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset
Figure 4 for Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset
Viaarxiv icon

IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task

Jun 30, 2021
Pavel Denisov, Manuel Mager, Ngoc Thang Vu

Figure 1 for IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Figure 2 for IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Figure 3 for IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Figure 4 for IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Viaarxiv icon

Deep Spoken Keyword Spotting: An Overview

Nov 20, 2021
Iván López-Espejo, Zheng-Hua Tan, John Hansen, Jesper Jensen

Figure 1 for Deep Spoken Keyword Spotting: An Overview
Figure 2 for Deep Spoken Keyword Spotting: An Overview
Figure 3 for Deep Spoken Keyword Spotting: An Overview
Figure 4 for Deep Spoken Keyword Spotting: An Overview
Viaarxiv icon

Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning

Sep 15, 2021
Songjun Cao, Yueteng Kang, Yanzhe Fu, Xiaoshuo Xu, Sining Sun, Yike Zhang, Long Ma

Figure 1 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 2 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 3 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Figure 4 for Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning
Viaarxiv icon

An Online Multilingual Hate speech Recognition System

Nov 24, 2020
Neeraj Vashistha, Arkaitz Zubiaga

Figure 1 for An Online Multilingual Hate speech Recognition System
Figure 2 for An Online Multilingual Hate speech Recognition System
Figure 3 for An Online Multilingual Hate speech Recognition System
Figure 4 for An Online Multilingual Hate speech Recognition System
Viaarxiv icon