Alert button

"speech": models, code, and papers
Alert button

Can Self-Supervised Neural Networks Pre-Trained on Human Speech distinguish Animal Callers?

Add code
Bookmark button
Alert button
May 23, 2023
Eklavya Sarkar, Mathew Magimai. -Doss

Viaarxiv icon

Spread Control Method on Unknown Networks Based on Hierarchical Reinforcement Learning

Aug 28, 2023
Wenxiang Dong, H. Vicky Zhao

Figure 1 for Spread Control Method on Unknown Networks Based on Hierarchical Reinforcement Learning
Figure 2 for Spread Control Method on Unknown Networks Based on Hierarchical Reinforcement Learning
Figure 3 for Spread Control Method on Unknown Networks Based on Hierarchical Reinforcement Learning
Viaarxiv icon

Turning Whisper into Real-Time Transcription System

Add code
Bookmark button
Alert button
Jul 27, 2023
Dominik Macháček, Raj Dabre, Ondřej Bojar

Figure 1 for Turning Whisper into Real-Time Transcription System
Figure 2 for Turning Whisper into Real-Time Transcription System
Figure 3 for Turning Whisper into Real-Time Transcription System
Figure 4 for Turning Whisper into Real-Time Transcription System
Viaarxiv icon

Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing

May 01, 2023
Ibrahim Malik, Siddique Latif, Sanaullah Manzoor, Muhammad Usama, Junaid Qadir, Raja Jurdak

Figure 1 for Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing
Figure 2 for Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing
Figure 3 for Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing
Figure 4 for Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing
Viaarxiv icon

Let's Give a Voice to Conversational Agents in Virtual Reality

Aug 04, 2023
Michele Yin, Gabriel Roccabruna, Abhinav Azad, Giuseppe Riccardi

Figure 1 for Let's Give a Voice to Conversational Agents in Virtual Reality
Figure 2 for Let's Give a Voice to Conversational Agents in Virtual Reality
Viaarxiv icon

Developing Social Robots with Empathetic Non-Verbal Cues Using Large Language Models

Aug 31, 2023
Yoon Kyung Lee, Yoonwon Jung, Gyuyi Kang, Sowon Hahn

Viaarxiv icon

Rehearsal-Free Online Continual Learning for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jun 19, 2023
Steven Vander Eeckt, Hugo Van hamme

Figure 1 for Rehearsal-Free Online Continual Learning for Automatic Speech Recognition
Figure 2 for Rehearsal-Free Online Continual Learning for Automatic Speech Recognition
Viaarxiv icon

CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training

Add code
Bookmark button
Alert button
May 27, 2023
Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma

Figure 1 for CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Figure 2 for CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Figure 3 for CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Figure 4 for CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training
Viaarxiv icon

Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering

Jun 06, 2023
Irina-Elena Veliche, Pascale Fung

Figure 1 for Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering
Figure 2 for Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering
Figure 3 for Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering
Figure 4 for Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering
Viaarxiv icon

Speech Intelligibility Classifiers from 550k Disordered Speech Samples

Mar 15, 2023
Subhashini Venugopalan, Jimmy Tobin, Samuel J. Yang, Katie Seaver, Richard J. N. Cave, Pan-Pan Jiang, Neil Zeghidour, Rus Heywood, Jordan Green, Michael P. Brenner

Figure 1 for Speech Intelligibility Classifiers from 550k Disordered Speech Samples
Figure 2 for Speech Intelligibility Classifiers from 550k Disordered Speech Samples
Figure 3 for Speech Intelligibility Classifiers from 550k Disordered Speech Samples
Figure 4 for Speech Intelligibility Classifiers from 550k Disordered Speech Samples
Viaarxiv icon