Alert button

"speech recognition": models, code, and papers
Alert button

Speech Emotion Recognition Using Deep Sparse Auto-Encoder Extreme Learning Machine with a New Weighting Scheme and Spectro-Temporal Features Along with Classical Feature Selection and A New Quantum-Inspired Dimension Reduction Method

Nov 13, 2021
Fatemeh Daneshfar, Seyed Jahanshah Kabudian

Figure 1 for Speech Emotion Recognition Using Deep Sparse Auto-Encoder Extreme Learning Machine with a New Weighting Scheme and Spectro-Temporal Features Along with Classical Feature Selection and A New Quantum-Inspired Dimension Reduction Method
Figure 2 for Speech Emotion Recognition Using Deep Sparse Auto-Encoder Extreme Learning Machine with a New Weighting Scheme and Spectro-Temporal Features Along with Classical Feature Selection and A New Quantum-Inspired Dimension Reduction Method
Figure 3 for Speech Emotion Recognition Using Deep Sparse Auto-Encoder Extreme Learning Machine with a New Weighting Scheme and Spectro-Temporal Features Along with Classical Feature Selection and A New Quantum-Inspired Dimension Reduction Method
Figure 4 for Speech Emotion Recognition Using Deep Sparse Auto-Encoder Extreme Learning Machine with a New Weighting Scheme and Spectro-Temporal Features Along with Classical Feature Selection and A New Quantum-Inspired Dimension Reduction Method
Viaarxiv icon

Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks

Add code
Bookmark button
Alert button
Feb 24, 2021
Ju Lin, Adriaan J. van Wijngaarden, Kuang-Ching Wang, Melissa C. Smith

Figure 1 for Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Figure 2 for Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Figure 3 for Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Figure 4 for Speech Enhancement Using Multi-Stage Self-Attentive Temporal Convolutional Networks
Viaarxiv icon

Language ID Prediction from Speech Using Self-Attentive Pooling and 1D-Convolutions

Apr 24, 2021
Roman Bedyakin, Nikolay Mikhaylovskiy

Figure 1 for Language ID Prediction from Speech Using Self-Attentive Pooling and 1D-Convolutions
Figure 2 for Language ID Prediction from Speech Using Self-Attentive Pooling and 1D-Convolutions
Figure 3 for Language ID Prediction from Speech Using Self-Attentive Pooling and 1D-Convolutions
Figure 4 for Language ID Prediction from Speech Using Self-Attentive Pooling and 1D-Convolutions
Viaarxiv icon

Revealing and Protecting Labels in Distributed Training

Add code
Bookmark button
Alert button
Oct 31, 2021
Trung Dang, Om Thakkar, Swaroop Ramaswamy, Rajiv Mathews, Peter Chin, Françoise Beaufays

Figure 1 for Revealing and Protecting Labels in Distributed Training
Figure 2 for Revealing and Protecting Labels in Distributed Training
Figure 3 for Revealing and Protecting Labels in Distributed Training
Figure 4 for Revealing and Protecting Labels in Distributed Training
Viaarxiv icon

A brief history of AI: how to prevent another winter (a critical review)

Sep 08, 2021
Amirhosein Toosi, Andrea Bottino, Babak Saboury, Eliot Siegel, Arman Rahmim

Figure 1 for A brief history of AI: how to prevent another winter (a critical review)
Figure 2 for A brief history of AI: how to prevent another winter (a critical review)
Figure 3 for A brief history of AI: how to prevent another winter (a critical review)
Figure 4 for A brief history of AI: how to prevent another winter (a critical review)
Viaarxiv icon

Word-Free Spoken Language Understanding for Mandarin-Chinese

Jul 01, 2021
Zhiyuan Guo, Yuexin Li, Guo Chen, Xingyu Chen, Akshat Gupta

Figure 1 for Word-Free Spoken Language Understanding for Mandarin-Chinese
Figure 2 for Word-Free Spoken Language Understanding for Mandarin-Chinese
Figure 3 for Word-Free Spoken Language Understanding for Mandarin-Chinese
Figure 4 for Word-Free Spoken Language Understanding for Mandarin-Chinese
Viaarxiv icon

Correlation based Multi-phasal models for improved imagined speech EEG recognition

Nov 04, 2020
Rini A Sharon, Hema A Murthy

Figure 1 for Correlation based Multi-phasal models for improved imagined speech EEG recognition
Figure 2 for Correlation based Multi-phasal models for improved imagined speech EEG recognition
Figure 3 for Correlation based Multi-phasal models for improved imagined speech EEG recognition
Figure 4 for Correlation based Multi-phasal models for improved imagined speech EEG recognition
Viaarxiv icon

Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling

Add code
Bookmark button
Alert button
Feb 07, 2018
Prashanth Gurunath Shivakumar, Haoqi Li, Kevin Knight, Panayiotis Georgiou

Figure 1 for Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling
Figure 2 for Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling
Figure 3 for Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling
Figure 4 for Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling
Viaarxiv icon

Token-Level Supervised Contrastive Learning for Punctuation Restoration

Add code
Bookmark button
Alert button
Jul 19, 2021
Qiushi Huang, Tom Ko, H Lilian Tang, Xubo Liu, Bo Wu

Figure 1 for Token-Level Supervised Contrastive Learning for Punctuation Restoration
Figure 2 for Token-Level Supervised Contrastive Learning for Punctuation Restoration
Figure 3 for Token-Level Supervised Contrastive Learning for Punctuation Restoration
Figure 4 for Token-Level Supervised Contrastive Learning for Punctuation Restoration
Viaarxiv icon

Noisy-to-Noisy Voice Conversion Framework with Denoising Model

Sep 22, 2021
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

Figure 1 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 2 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 3 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 4 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Viaarxiv icon