Alert button

"speech": models, code, and papers
Alert button

Speech-driven facial animation using polynomial fusion of features

Dec 12, 2019
Triantafyllos Kefalas, Konstantinos Vougioukas, Yannis Panagakis, Stavros Petridis, Jean Kossaifi, Maja Pantic

Figure 1 for Speech-driven facial animation using polynomial fusion of features
Figure 2 for Speech-driven facial animation using polynomial fusion of features
Viaarxiv icon

Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System

Oct 03, 2019
Kai Fan, Jiayi Wang, Bo Li, Boxing Chen, Niyu Ge

Figure 1 for Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System
Figure 2 for Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System
Figure 3 for Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System
Figure 4 for Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System
Viaarxiv icon

Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment

Oct 24, 2020
Ethan A. Chi, Julian Salazar, Katrin Kirchhoff

Figure 1 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 2 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 3 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 4 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Viaarxiv icon

Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis

Add code
Bookmark button
Alert button
Jan 18, 2022
Hang Jiang, Yining Hua, Doug Beeferman, Deb Roy

Figure 1 for Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis
Figure 2 for Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis
Figure 3 for Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis
Figure 4 for Annotating the Tweebank Corpus on Named Entity Recognition and Building NLP Models for Social Media Analysis
Viaarxiv icon

Towards Learning Universal Audio Representations

Add code
Bookmark button
Alert button
Dec 01, 2021
Luyu Wang, Pauline Luc, Yan Wu, Adria Recasens, Lucas Smaira, Andrew Brock, Andrew Jaegle, Jean-Baptiste Alayrac, Sander Dieleman, Joao Carreira, Aaron van den Oord

Figure 1 for Towards Learning Universal Audio Representations
Figure 2 for Towards Learning Universal Audio Representations
Figure 3 for Towards Learning Universal Audio Representations
Figure 4 for Towards Learning Universal Audio Representations
Viaarxiv icon

Universal adversarial examples in speech command classification

Add code
Bookmark button
Alert button
Nov 26, 2019
Jon Vadillo, Roberto Santana

Figure 1 for Universal adversarial examples in speech command classification
Figure 2 for Universal adversarial examples in speech command classification
Figure 3 for Universal adversarial examples in speech command classification
Figure 4 for Universal adversarial examples in speech command classification
Viaarxiv icon

Punctuation Restoration

Add code
Bookmark button
Alert button
Feb 19, 2022
Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen

Figure 1 for Punctuation Restoration
Figure 2 for Punctuation Restoration
Figure 3 for Punctuation Restoration
Figure 4 for Punctuation Restoration
Viaarxiv icon

Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework

Add code
Bookmark button
Alert button
Nov 07, 2019
Mingbo Ma, Baigong Zheng, Kaibo Liu, Renjie Zheng, Hairong Liu, Kainan Peng, Kenneth Church, Liang Huang

Figure 1 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 2 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 3 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Figure 4 for Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework
Viaarxiv icon

End-to-End Multi-Channel Speech Separation

May 15, 2019
Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu

Figure 1 for End-to-End Multi-Channel Speech Separation
Figure 2 for End-to-End Multi-Channel Speech Separation
Figure 3 for End-to-End Multi-Channel Speech Separation
Figure 4 for End-to-End Multi-Channel Speech Separation
Viaarxiv icon

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR

Apr 01, 2022
Yu Nakagome, Tatsuya Komatsu, Yusuke Fujita, Shuta Ichimura, Yusuke Kida

Figure 1 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 2 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 3 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 4 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Viaarxiv icon