Alert button

"speech": models, code, and papers
Alert button

The slurk Interaction Server Framework: Better Data for Better Dialog Models

Add code
Bookmark button
Alert button
Feb 02, 2022
Jana Götze, Maike Paetzel-Prüsmann, Wencke Liermann, Tim Diekmann, David Schlangen

Figure 1 for The slurk Interaction Server Framework: Better Data for Better Dialog Models
Figure 2 for The slurk Interaction Server Framework: Better Data for Better Dialog Models
Figure 3 for The slurk Interaction Server Framework: Better Data for Better Dialog Models
Figure 4 for The slurk Interaction Server Framework: Better Data for Better Dialog Models
Viaarxiv icon

Attention-Free Keyword Spotting

Oct 18, 2021
Mashrur M. Morshed, Ahmad Omar Ahsan

Figure 1 for Attention-Free Keyword Spotting
Figure 2 for Attention-Free Keyword Spotting
Figure 3 for Attention-Free Keyword Spotting
Figure 4 for Attention-Free Keyword Spotting
Viaarxiv icon

Pathological speech detection using x-vector embeddings

Mar 03, 2020
Catarina Botelho, Francisco Teixeira, Thomas Rolland, Alberto Abad, Isabel Trancoso

Figure 1 for Pathological speech detection using x-vector embeddings
Figure 2 for Pathological speech detection using x-vector embeddings
Figure 3 for Pathological speech detection using x-vector embeddings
Figure 4 for Pathological speech detection using x-vector embeddings
Viaarxiv icon

End-to-End Multi-Channel Speech Separation

May 28, 2019
Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu

Figure 1 for End-to-End Multi-Channel Speech Separation
Figure 2 for End-to-End Multi-Channel Speech Separation
Figure 3 for End-to-End Multi-Channel Speech Separation
Figure 4 for End-to-End Multi-Channel Speech Separation
Viaarxiv icon

Population Based Training for Data Augmentation and Regularization in Speech Recognition

Oct 08, 2020
Daniel Haziza, Jérémy Rapin, Gabriel Synnaeve

Figure 1 for Population Based Training for Data Augmentation and Regularization in Speech Recognition
Figure 2 for Population Based Training for Data Augmentation and Regularization in Speech Recognition
Figure 3 for Population Based Training for Data Augmentation and Regularization in Speech Recognition
Figure 4 for Population Based Training for Data Augmentation and Regularization in Speech Recognition
Viaarxiv icon

How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

Add code
Bookmark button
Alert button
Sep 21, 2021
Badr M. Abdullah, Iuliia Zaitova, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

Figure 1 for How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings
Figure 2 for How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings
Figure 3 for How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings
Figure 4 for How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings
Viaarxiv icon

A scalable noisy speech dataset and online subjective test framework

Sep 17, 2019
Chandan K. A. Reddy, Ebrahim Beyrami, Jamie Pool, Ross Cutler, Sriram Srinivasan, Johannes Gehrke

Figure 1 for A scalable noisy speech dataset and online subjective test framework
Figure 2 for A scalable noisy speech dataset and online subjective test framework
Figure 3 for A scalable noisy speech dataset and online subjective test framework
Viaarxiv icon

Unraveling Social Perceptions & Behaviors towards Migrants on Twitter

Dec 04, 2021
Aparup Khatua, Wolfgang Nejdl

Figure 1 for Unraveling Social Perceptions & Behaviors towards Migrants on Twitter
Figure 2 for Unraveling Social Perceptions & Behaviors towards Migrants on Twitter
Figure 3 for Unraveling Social Perceptions & Behaviors towards Migrants on Twitter
Figure 4 for Unraveling Social Perceptions & Behaviors towards Migrants on Twitter
Viaarxiv icon

Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment

Oct 24, 2020
Ethan A. Chi, Julian Salazar, Katrin Kirchhoff

Figure 1 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 2 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 3 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 4 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Viaarxiv icon

How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition

Add code
Bookmark button
Alert button
Apr 17, 2020
George Sterpu, Christian Saam, Naomi Harte

Figure 1 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 2 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 3 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Figure 4 for How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Viaarxiv icon