Alert button

"speech": models, code, and papers
Alert button

HAIDA: Biometric technological therapy tools for neurorehabilitation of Cognitive Impairment

Mar 09, 2022
Elsa Fernandez, Jordi Sole-Casals, Pilar M. Calvo, Marcos Faundez-Zanuy, Karmele Lopez-de-Ipina

Figure 1 for HAIDA: Biometric technological therapy tools for neurorehabilitation of Cognitive Impairment
Viaarxiv icon

3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

Add code
Bookmark button
Alert button
Apr 07, 2022
Zhao You, Shulin Feng, Dan Su, Dong Yu

Figure 1 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 2 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 3 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 4 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Viaarxiv icon

Do face masks introduce bias in speech technologies? The case of automated scoring of speaking proficiency

Add code
Bookmark button
Alert button
Aug 17, 2020
Anastassia Loukina, Keelan Evanini, Matthew Mulholland, Ian Blood, Klaus Zechner

Figure 1 for Do face masks introduce bias in speech technologies? The case of automated scoring of speaking proficiency
Figure 2 for Do face masks introduce bias in speech technologies? The case of automated scoring of speaking proficiency
Figure 3 for Do face masks introduce bias in speech technologies? The case of automated scoring of speaking proficiency
Viaarxiv icon

Multi-head Monotonic Chunkwise Attention For Online Speech Recognition

May 01, 2020
Baiji Liu, Songjun Cao, Sining Sun, Weibin Zhang, Long Ma

Figure 1 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Figure 2 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Figure 3 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Viaarxiv icon

End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning

Aug 13, 2019
Pavel Denisov, Ngoc Thang Vu

Figure 1 for End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Figure 2 for End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Figure 3 for End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Figure 4 for End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
Viaarxiv icon

Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture

Jan 15, 2020
Haoran Miao, Gaofeng Cheng, Changfeng Gao, Pengyuan Zhang, Yonghong Yan

Figure 1 for Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture
Figure 2 for Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture
Figure 3 for Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture
Figure 4 for Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture
Viaarxiv icon

Looking Enhances Listening: Recovering Missing Speech Using Images

Add code
Bookmark button
Alert button
Feb 13, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze

Figure 1 for Looking Enhances Listening: Recovering Missing Speech Using Images
Figure 2 for Looking Enhances Listening: Recovering Missing Speech Using Images
Figure 3 for Looking Enhances Listening: Recovering Missing Speech Using Images
Figure 4 for Looking Enhances Listening: Recovering Missing Speech Using Images
Viaarxiv icon

A Summary of the ComParE COVID-19 Challenges

Add code
Bookmark button
Alert button
Feb 17, 2022
Harry Coppock, Alican Akman, Christian Bergler, Maurice Gerczuk, Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Jing Han, Shahin Amiriparian, Alice Baird, Lukas Stappen, Sandra Ottl, Panagiotis Tzirakis, Anton Batliner, Cecilia Mascolo, Björn W. Schuller

Figure 1 for A Summary of the ComParE COVID-19 Challenges
Figure 2 for A Summary of the ComParE COVID-19 Challenges
Figure 3 for A Summary of the ComParE COVID-19 Challenges
Figure 4 for A Summary of the ComParE COVID-19 Challenges
Viaarxiv icon

A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization

May 10, 2022
Yu-Hsiang Chiang, Tian-Sheuan Chang, Shyh Jye Jou

Figure 1 for A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization
Figure 2 for A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization
Figure 3 for A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization
Figure 4 for A 14uJ/Decision Keyword Spotting Accelerator with In-SRAM-Computing and On Chip Learning for Customization
Viaarxiv icon

Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder

Oct 14, 2019
Cristina Gârbacea, Aäron van den Oord, Yazhe Li, Felicia S C Lim, Alejandro Luebs, Oriol Vinyals, Thomas C Walters

Figure 1 for Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Figure 2 for Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Figure 3 for Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Figure 4 for Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder
Viaarxiv icon