Alert button

"speech": models, code, and papers
Alert button

Towards Visually Grounded Sub-Word Speech Unit Discovery

Feb 21, 2019
David Harwath, James Glass

Figure 1 for Towards Visually Grounded Sub-Word Speech Unit Discovery
Figure 2 for Towards Visually Grounded Sub-Word Speech Unit Discovery
Figure 3 for Towards Visually Grounded Sub-Word Speech Unit Discovery
Figure 4 for Towards Visually Grounded Sub-Word Speech Unit Discovery
Viaarxiv icon

Capitalization and Punctuation Restoration: a Survey

Nov 21, 2021
Vasile Păiş, Dan Tufiş

Figure 1 for Capitalization and Punctuation Restoration: a Survey
Figure 2 for Capitalization and Punctuation Restoration: a Survey
Viaarxiv icon

Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems

May 21, 2019
Ohsung Kwon, Eunwoo Song, Jae-Min Kim, Hong-Goo Kang

Figure 1 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 2 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 3 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Figure 4 for Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems
Viaarxiv icon

Medication Error Detection Using Contextual Language Models

Jan 09, 2022
Yu Jiang, Christian Poellabauer

Figure 1 for Medication Error Detection Using Contextual Language Models
Figure 2 for Medication Error Detection Using Contextual Language Models
Figure 3 for Medication Error Detection Using Contextual Language Models
Figure 4 for Medication Error Detection Using Contextual Language Models
Viaarxiv icon

Single-channel Speech Dereverberation via Generative Adversarial Training

Jun 25, 2018
Chenxing Li, Tieqiang Wang, Shuang Xu, Bo Xu

Figure 1 for Single-channel Speech Dereverberation via Generative Adversarial Training
Figure 2 for Single-channel Speech Dereverberation via Generative Adversarial Training
Figure 3 for Single-channel Speech Dereverberation via Generative Adversarial Training
Figure 4 for Single-channel Speech Dereverberation via Generative Adversarial Training
Viaarxiv icon

Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions

Jul 01, 2019
Manuel Sam Ribeiro, Aciel Eshky, Korin Richmond, Steve Renals

Figure 1 for Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions
Figure 2 for Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions
Figure 3 for Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions
Figure 4 for Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions
Viaarxiv icon

Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation

Apr 15, 2019
Matthias Sperber, Graham Neubig, Jan Niehues, Alex Waibel

Viaarxiv icon

Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems

Dec 19, 2019
Nick Rossenbach, Albert Zeyer, Ralf Schlüter, Hermann Ney

Figure 1 for Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems
Figure 2 for Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems
Figure 3 for Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems
Figure 4 for Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems
Viaarxiv icon

End-to-end Generative Pretraining for Multimodal Video Captioning

Jan 20, 2022
Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid

Figure 1 for End-to-end Generative Pretraining for Multimodal Video Captioning
Figure 2 for End-to-end Generative Pretraining for Multimodal Video Captioning
Figure 3 for End-to-end Generative Pretraining for Multimodal Video Captioning
Figure 4 for End-to-end Generative Pretraining for Multimodal Video Captioning
Viaarxiv icon

A Comparative Study on End-to-end Speech to Text Translation

Nov 20, 2019
Parnia Bahar, Tobias Bieschke, Hermann Ney

Figure 1 for A Comparative Study on End-to-end Speech to Text Translation
Figure 2 for A Comparative Study on End-to-end Speech to Text Translation
Figure 3 for A Comparative Study on End-to-end Speech to Text Translation
Figure 4 for A Comparative Study on End-to-end Speech to Text Translation
Viaarxiv icon