Alert button

"speech recognition": models, code, and papers
Alert button

VOTE400(Voide Of The Elderly 400 Hours): A Speech Dataset to Study Voice Interface for Elderly-Care

Add code
Bookmark button
Alert button
Jan 20, 2021
Minsu Jang, Sangwon Seo, Dohyung Kim, Jaeyeon Lee, Jaehong Kim, Jun-Hwan Ahn

Figure 1 for VOTE400(Voide Of The Elderly 400 Hours): A Speech Dataset to Study Voice Interface for Elderly-Care
Figure 2 for VOTE400(Voide Of The Elderly 400 Hours): A Speech Dataset to Study Voice Interface for Elderly-Care
Figure 3 for VOTE400(Voide Of The Elderly 400 Hours): A Speech Dataset to Study Voice Interface for Elderly-Care
Figure 4 for VOTE400(Voide Of The Elderly 400 Hours): A Speech Dataset to Study Voice Interface for Elderly-Care
Viaarxiv icon

Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition

Oct 11, 2016
Xiangang Li, Xihong Wu

Figure 1 for Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
Figure 2 for Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
Figure 3 for Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
Figure 4 for Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
Viaarxiv icon

Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging

Aug 07, 2019
Binh Nguyen, Vu Bao Hung Nguyen, Hien Nguyen, Pham Ngoc Phuong, The-Loc Nguyen, Quoc Truong Do, Luong Chi Mai

Figure 1 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 2 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 3 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 4 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Viaarxiv icon

Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching

Dec 23, 2018
Chih-Kuan Yeh, Jianshu Chen, Chengzhu Yu, Dong Yu

Figure 1 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 2 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 3 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 4 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Viaarxiv icon

Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition

Nov 12, 2018
Raden Mu'az Mun'im, Nakamasa Inoue, Koichi Shinoda

Figure 1 for Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Figure 2 for Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Figure 3 for Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Figure 4 for Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Viaarxiv icon

Speech Emotion Recognition using Semantic Information

Add code
Bookmark button
Alert button
Mar 04, 2021
Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn W. Schuller

Figure 1 for Speech Emotion Recognition using Semantic Information
Figure 2 for Speech Emotion Recognition using Semantic Information
Figure 3 for Speech Emotion Recognition using Semantic Information
Figure 4 for Speech Emotion Recognition using Semantic Information
Viaarxiv icon

Recent improvements of ASR models in the face of adversarial attacks

Add code
Bookmark button
Alert button
Apr 04, 2022
Raphael Olivier, Bhiksha Raj

Figure 1 for Recent improvements of ASR models in the face of adversarial attacks
Figure 2 for Recent improvements of ASR models in the face of adversarial attacks
Figure 3 for Recent improvements of ASR models in the face of adversarial attacks
Figure 4 for Recent improvements of ASR models in the face of adversarial attacks
Viaarxiv icon

An Initialization Scheme for Meeting Separation with Spatial Mixture Models

Apr 04, 2022
Christoph Boeddeker, Tobias Cord-Landwehr, Thilo von Neumann, Reinhold Haeb-Umbach

Figure 1 for An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Figure 2 for An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Figure 3 for An Initialization Scheme for Meeting Separation with Spatial Mixture Models
Viaarxiv icon

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy

May 06, 2022
Sankaran Panchapagesan, Arun Narayanan, Turaj Zakizadeh Shabestary, Shuai Shao, Nathan Howard, Alex Park, James Walker, Alexander Gruenstein

Figure 1 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 2 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 3 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Figure 4 for A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy
Viaarxiv icon

Set-based Meta-Interpolation for Few-Task Meta-Learning

Add code
Bookmark button
Alert button
May 20, 2022
Seanie Lee, Bruno Andreis, Kenji Kawaguchi, Juho Lee, Sung Ju Hwang

Figure 1 for Set-based Meta-Interpolation for Few-Task Meta-Learning
Figure 2 for Set-based Meta-Interpolation for Few-Task Meta-Learning
Figure 3 for Set-based Meta-Interpolation for Few-Task Meta-Learning
Figure 4 for Set-based Meta-Interpolation for Few-Task Meta-Learning
Viaarxiv icon