Alert button

"speech recognition": models, code, and papers
Alert button

Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition

Sep 02, 2020
Wei Li, James Qin, Chung-Cheng Chiu, Ruoming Pang, Yanzhang He

Figure 1 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 2 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 3 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 4 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Viaarxiv icon

Continuous Pseudo-Labeling from the Start

Add code
Bookmark button
Alert button
Oct 17, 2022
Dan Berrebbi, Ronan Collobert, Samy Bengio, Navdeep Jaitly, Tatiana Likhomanenko

Figure 1 for Continuous Pseudo-Labeling from the Start
Figure 2 for Continuous Pseudo-Labeling from the Start
Figure 3 for Continuous Pseudo-Labeling from the Start
Figure 4 for Continuous Pseudo-Labeling from the Start
Viaarxiv icon

A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction

Add code
Bookmark button
Alert button
Mar 31, 2022
Zexu Pan, Meng Ge, Haizhou Li

Figure 1 for A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Figure 2 for A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Figure 3 for A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Figure 4 for A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction
Viaarxiv icon

Multiclass ASMA vs Targeted PGD Attack in Image Segmentation

Add code
Bookmark button
Alert button
Aug 03, 2022
Johnson Vo, Jiabao Xie, Sahil Patel

Figure 1 for Multiclass ASMA vs Targeted PGD Attack in Image Segmentation
Figure 2 for Multiclass ASMA vs Targeted PGD Attack in Image Segmentation
Figure 3 for Multiclass ASMA vs Targeted PGD Attack in Image Segmentation
Figure 4 for Multiclass ASMA vs Targeted PGD Attack in Image Segmentation
Viaarxiv icon

Arabic Speech Recognition System using CMU-Sphinx4

Apr 17, 2007
H. Satori, M. Harti, N. Chenfour

Figure 1 for Arabic Speech Recognition System using CMU-Sphinx4
Figure 2 for Arabic Speech Recognition System using CMU-Sphinx4
Figure 3 for Arabic Speech Recognition System using CMU-Sphinx4
Figure 4 for Arabic Speech Recognition System using CMU-Sphinx4
Viaarxiv icon

How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition

Add code
Bookmark button
Alert button
Apr 17, 2020
George Sterpu, Christian Saam, Naomi Harte

Figure 1 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 2 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 3 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 4 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Viaarxiv icon

Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation

Add code
Bookmark button
Alert button
Apr 12, 2022
Wenjing Zhu, Xiang Li

Figure 1 for Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation
Figure 2 for Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation
Figure 3 for Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation
Figure 4 for Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation
Viaarxiv icon

Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications

Oct 29, 2020
Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao

Figure 1 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 2 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 3 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 4 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Viaarxiv icon

Incorporating End-to-End Speech Recognition Models for Sentiment Analysis

Add code
Bookmark button
Alert button
Feb 28, 2019
Egor Lakomkin, Mohammad Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter

Figure 1 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 2 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 3 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Figure 4 for Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
Viaarxiv icon