Alert button

"speech": models, code, and papers
Alert button

"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations

Add code
Bookmark button
Alert button
Sep 28, 2021
Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan, Behnam Hedayatnia, Dilek Hakkani-Tur

Figure 1 for "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations
Figure 2 for "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations
Figure 3 for "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations
Figure 4 for "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations
Viaarxiv icon

AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy

Apr 18, 2022
Raphael Petegrosso, Vasistakrishna Baderdinni, Thibaud Senechal, Benjamin L. Bullough

Figure 1 for AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy
Figure 2 for AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy
Figure 3 for AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy
Figure 4 for AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy
Viaarxiv icon

Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation

Jul 04, 2021
Ryo Masumura, Daiki Okamura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi

Figure 1 for Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation
Figure 2 for Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation
Viaarxiv icon

Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue

Apr 18, 2022
Jiudong Yang, Peiying Wang, Yi Zhu, Mingchao Feng, Meng Chen, Xiaodong He

Figure 1 for Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue
Figure 2 for Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue
Figure 3 for Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue
Figure 4 for Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue
Viaarxiv icon

Robust End-to-end Speaker Diarization with Generic Neural Clustering

Apr 18, 2022
Chenyu Yang, Yu Wang

Figure 1 for Robust End-to-end Speaker Diarization with Generic Neural Clustering
Figure 2 for Robust End-to-end Speaker Diarization with Generic Neural Clustering
Figure 3 for Robust End-to-end Speaker Diarization with Generic Neural Clustering
Figure 4 for Robust End-to-end Speaker Diarization with Generic Neural Clustering
Viaarxiv icon

ESNI: Domestic Robots Design for Elderly and Disabled People

Mar 30, 2022
Junchi Chu, Xueyun Tang

Figure 1 for ESNI: Domestic Robots Design for Elderly and Disabled People
Figure 2 for ESNI: Domestic Robots Design for Elderly and Disabled People
Figure 3 for ESNI: Domestic Robots Design for Elderly and Disabled People
Figure 4 for ESNI: Domestic Robots Design for Elderly and Disabled People
Viaarxiv icon

Learning to detect dysarthria from raw speech

Add code
Bookmark button
Alert button
Nov 27, 2018
Juliette Millet, Neil Zeghidour

Figure 1 for Learning to detect dysarthria from raw speech
Figure 2 for Learning to detect dysarthria from raw speech
Figure 3 for Learning to detect dysarthria from raw speech
Figure 4 for Learning to detect dysarthria from raw speech
Viaarxiv icon

AISPEECH-SJTU accent identification system for the Accented English Speech Recognition Challenge

Feb 19, 2021
Houjun Huang, Xu Xiang, Yexin Yang, Rao Ma, Yanmin Qian

Figure 1 for AISPEECH-SJTU accent identification system for the Accented English Speech Recognition Challenge
Figure 2 for AISPEECH-SJTU accent identification system for the Accented English Speech Recognition Challenge
Figure 3 for AISPEECH-SJTU accent identification system for the Accented English Speech Recognition Challenge
Figure 4 for AISPEECH-SJTU accent identification system for the Accented English Speech Recognition Challenge
Viaarxiv icon

Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders

Add code
Bookmark button
Alert button
Nov 10, 2019
Mostafa Sadeghi, Xavier Alameda-Pineda

Figure 1 for Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders
Viaarxiv icon

Short-Term Word-Learning in a Dynamically Changing Environment

Add code
Bookmark button
Alert button
Mar 29, 2022
Christian Huber, Rishu Kumar, Ondřej Bojar, Alexander Waibel

Figure 1 for Short-Term Word-Learning in a Dynamically Changing Environment
Figure 2 for Short-Term Word-Learning in a Dynamically Changing Environment
Figure 3 for Short-Term Word-Learning in a Dynamically Changing Environment
Figure 4 for Short-Term Word-Learning in a Dynamically Changing Environment
Viaarxiv icon