Alert button

"speech": models, code, and papers
Alert button

Recognizing long-form speech using streaming end-to-end models

Oct 24, 2019
Arun Narayanan, Rohit Prabhavalkar, Chung-Cheng Chiu, David Rybach, Tara N. Sainath, Trevor Strohman

Figure 1 for Recognizing long-form speech using streaming end-to-end models
Figure 2 for Recognizing long-form speech using streaming end-to-end models
Figure 3 for Recognizing long-form speech using streaming end-to-end models
Figure 4 for Recognizing long-form speech using streaming end-to-end models
Viaarxiv icon

Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models

Nov 21, 2019
Zhiyun Lu, Liangliang Cao, Yu Zhang, Chung-Cheng Chiu, James Fan

Figure 1 for Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models
Figure 2 for Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models
Figure 3 for Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models
Figure 4 for Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models
Viaarxiv icon

Forecast Evaluation for Data Scientists: Common Pitfalls and Best Practices

Apr 04, 2022
Hansika Hewamalage, Klaus Ackermann, Christoph Bergmeir

Figure 1 for Forecast Evaluation for Data Scientists: Common Pitfalls and Best Practices
Figure 2 for Forecast Evaluation for Data Scientists: Common Pitfalls and Best Practices
Figure 3 for Forecast Evaluation for Data Scientists: Common Pitfalls and Best Practices
Figure 4 for Forecast Evaluation for Data Scientists: Common Pitfalls and Best Practices
Viaarxiv icon

Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application

Add code
Bookmark button
Alert button
Sep 22, 2020
Chris J. Kennedy, Geoff Bacon, Alexander Sahn, Claudia von Vacano

Figure 1 for Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Figure 2 for Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Figure 3 for Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Figure 4 for Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Viaarxiv icon

Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

Add code
Bookmark button
Alert button
Apr 06, 2019
Santiago Pascual, Mirco Ravanelli, Joan Serrà, Antonio Bonafonte, Yoshua Bengio

Figure 1 for Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Figure 2 for Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Figure 3 for Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Figure 4 for Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
Viaarxiv icon

Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Add code
Bookmark button
Alert button
Oct 23, 2019
Eric Battenberg, RJ Skerry-Ryan, Soroosh Mariooryad, Daisy Stanton, David Kao, Matt Shannon, Tom Bagby

Figure 1 for Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Figure 2 for Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Figure 3 for Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Figure 4 for Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Viaarxiv icon

Robust Neural Machine Translation for Clean and Noisy Speech Transcripts

Oct 22, 2019
Mattia Antonino Di Gangi, Robert Enyedi, Alessandra Brusadin, Marcello Federico

Figure 1 for Robust Neural Machine Translation for Clean and Noisy Speech Transcripts
Figure 2 for Robust Neural Machine Translation for Clean and Noisy Speech Transcripts
Figure 3 for Robust Neural Machine Translation for Clean and Noisy Speech Transcripts
Figure 4 for Robust Neural Machine Translation for Clean and Noisy Speech Transcripts
Viaarxiv icon

End-to-End Automatic Speech Translation of Audiobooks

Add code
Bookmark button
Alert button
Feb 12, 2018
Alexandre Bérard, Laurent Besacier, Ali Can Kocabiyikoglu, Olivier Pietquin

Figure 1 for End-to-End Automatic Speech Translation of Audiobooks
Figure 2 for End-to-End Automatic Speech Translation of Audiobooks
Figure 3 for End-to-End Automatic Speech Translation of Audiobooks
Figure 4 for End-to-End Automatic Speech Translation of Audiobooks
Viaarxiv icon

Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation

Jul 03, 2019
Siyuan Feng, Tan Lee

Figure 1 for Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Figure 2 for Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Figure 3 for Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Figure 4 for Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
Viaarxiv icon

Online Automatic Speech Recognition with Listen, Attend and Spell Model

Aug 12, 2020
Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal

Figure 1 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 2 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 3 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 4 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Viaarxiv icon