Alert button

"speech recognition": models, code, and papers
Alert button

ASR Error Detection via Audio-Transcript entailment

Jul 22, 2022
Nimshi Venkat Meripo, Sandeep Konam

Figure 1 for ASR Error Detection via Audio-Transcript entailment
Figure 2 for ASR Error Detection via Audio-Transcript entailment
Figure 3 for ASR Error Detection via Audio-Transcript entailment
Figure 4 for ASR Error Detection via Audio-Transcript entailment
Viaarxiv icon

Biologically inspired speech emotion recognition

Nov 15, 2021
Reza Lotfidereshgi, Philippe Gournay

Figure 1 for Biologically inspired speech emotion recognition
Figure 2 for Biologically inspired speech emotion recognition
Figure 3 for Biologically inspired speech emotion recognition
Figure 4 for Biologically inspired speech emotion recognition
Viaarxiv icon

A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing

Add code
Bookmark button
Alert button
Mar 18, 2022
He Bai, Renjie Zheng, Junkun Chen, Xintong Li, Mingbo Ma, Liang Huang

Figure 1 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 2 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 3 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 4 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Viaarxiv icon

Analysis of French Phonetic Idiosyncrasies for Accent Recognition

Add code
Bookmark button
Alert button
Oct 18, 2021
Pierre Berjon, Avishek Nag, Soumyabrata Dev

Figure 1 for Analysis of French Phonetic Idiosyncrasies for Accent Recognition
Figure 2 for Analysis of French Phonetic Idiosyncrasies for Accent Recognition
Figure 3 for Analysis of French Phonetic Idiosyncrasies for Accent Recognition
Figure 4 for Analysis of French Phonetic Idiosyncrasies for Accent Recognition
Viaarxiv icon

indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages

Add code
Bookmark button
Alert button
Mar 31, 2022
Anirudh Gupta, Neeraj Chhimwal, Ankur Dhuriya, Rishabh Gaur, Priyanshi Shah, Harveen Singh Chadha, Vivek Raghavan

Figure 1 for indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages
Figure 2 for indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages
Figure 3 for indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages
Viaarxiv icon

Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection

Add code
Bookmark button
Alert button
Nov 10, 2017
Taku Kato, Takahiro Shinozaki

Figure 1 for Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Figure 2 for Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Figure 3 for Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Figure 4 for Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection
Viaarxiv icon

Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling

Nov 10, 2018
Hainan Xu, Shuoyang Ding, Shinji Watanabe

Figure 1 for Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Figure 2 for Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Figure 3 for Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Figure 4 for Improving End-to-end Speech Recognition with Pronunciation-assisted Sub-word Modeling
Viaarxiv icon

Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding

Add code
Bookmark button
Alert button
Oct 28, 2019
Alexander H. Liu, Tzu-Wei Sung, Shun-Po Chuang, Hung-yi Lee, Lin-shan Lee

Figure 1 for Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding
Figure 2 for Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding
Figure 3 for Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding
Viaarxiv icon

Towards Privacy-Preserving Speech Representation for Client-Side Data Sharing

Add code
Bookmark button
Alert button
Mar 26, 2022
Minh Tran, Mohammad Soleymani

Figure 1 for Towards Privacy-Preserving Speech Representation for Client-Side Data Sharing
Figure 2 for Towards Privacy-Preserving Speech Representation for Client-Side Data Sharing
Figure 3 for Towards Privacy-Preserving Speech Representation for Client-Side Data Sharing
Figure 4 for Towards Privacy-Preserving Speech Representation for Client-Side Data Sharing
Viaarxiv icon

Linguistic and Gender Variation in Speech Emotion Recognition using Spectral Features

Add code
Bookmark button
Alert button
Dec 17, 2021
Zachary Dair, Ryan Donovan, Ruairi O'Reilly

Figure 1 for Linguistic and Gender Variation in Speech Emotion Recognition using Spectral Features
Figure 2 for Linguistic and Gender Variation in Speech Emotion Recognition using Spectral Features
Figure 3 for Linguistic and Gender Variation in Speech Emotion Recognition using Spectral Features
Figure 4 for Linguistic and Gender Variation in Speech Emotion Recognition using Spectral Features
Viaarxiv icon