Alert button

"speech": models, code, and papers
Alert button

Self-move and Other-move: Quantum Categorical Foundations of Japanese

Oct 10, 2022
Ryder Dale Walton

Figure 1 for Self-move and Other-move: Quantum Categorical Foundations of Japanese
Figure 2 for Self-move and Other-move: Quantum Categorical Foundations of Japanese
Figure 3 for Self-move and Other-move: Quantum Categorical Foundations of Japanese
Figure 4 for Self-move and Other-move: Quantum Categorical Foundations of Japanese
Viaarxiv icon

Estimating the confidence of speech spoofing countermeasure

Add code
Bookmark button
Alert button
Oct 10, 2021
Xin Wang, Junichi Yamagishi

Figure 1 for Estimating the confidence of speech spoofing countermeasure
Figure 2 for Estimating the confidence of speech spoofing countermeasure
Figure 3 for Estimating the confidence of speech spoofing countermeasure
Figure 4 for Estimating the confidence of speech spoofing countermeasure
Viaarxiv icon

Model architectures to extrapolate emotional expressions in DNN-based text-to-speech

Feb 20, 2021
Katsuki Inoue, Sunao Hara, Masanobu Abe, Nobukatsu Hojo, Yusuke Ijima

Figure 1 for Model architectures to extrapolate emotional expressions in DNN-based text-to-speech
Figure 2 for Model architectures to extrapolate emotional expressions in DNN-based text-to-speech
Figure 3 for Model architectures to extrapolate emotional expressions in DNN-based text-to-speech
Figure 4 for Model architectures to extrapolate emotional expressions in DNN-based text-to-speech
Viaarxiv icon

A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data

Jun 01, 2021
Nathan Howard, Alex Park, Turaj Zakizadeh Shabestary, Alexander Gruenstein, Rohit Prabhavalkar

Figure 1 for A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data
Figure 2 for A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data
Figure 3 for A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data
Figure 4 for A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data
Viaarxiv icon

On the Relevance of Bandwidth Extension for Speaker Verification

Apr 05, 2022
Marcos Faundez-Zanuy, Mattias Nilsson, W. Bastiaan Kleijn

Figure 1 for On the Relevance of Bandwidth Extension for Speaker Verification
Figure 2 for On the Relevance of Bandwidth Extension for Speaker Verification
Figure 3 for On the Relevance of Bandwidth Extension for Speaker Verification
Figure 4 for On the Relevance of Bandwidth Extension for Speaker Verification
Viaarxiv icon

A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework

Aug 21, 2022
Avinash Madasu, Anvesh Rao Vijjini

Figure 1 for A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework
Figure 2 for A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework
Figure 3 for A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework
Figure 4 for A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework
Viaarxiv icon

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM

Add code
Bookmark button
Alert button
Sep 08, 2022
Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara

Figure 1 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 2 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 3 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 4 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Viaarxiv icon

Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation

Jun 16, 2021
Jisi Zhang, Catalin Zorila, Rama Doddipatla, Jon Barker

Figure 1 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 2 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 3 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 4 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Viaarxiv icon

Streaming end-to-end speech recognition with jointly trained neural feature enhancement

May 04, 2021
Chanwoo Kim, Abhinav Garg, Dhananjaya Gowda, Seongkyu Mun, Changwoo Han

Figure 1 for Streaming end-to-end speech recognition with jointly trained neural feature enhancement
Figure 2 for Streaming end-to-end speech recognition with jointly trained neural feature enhancement
Viaarxiv icon

Universal Fourier Attack for Time Series

Sep 02, 2022
Elizabeth Coda, Brad Clymer, Chance DeSmet, Yijing Watkins, Michael Girard

Figure 1 for Universal Fourier Attack for Time Series
Figure 2 for Universal Fourier Attack for Time Series
Figure 3 for Universal Fourier Attack for Time Series
Figure 4 for Universal Fourier Attack for Time Series
Viaarxiv icon