Alert button

"speech recognition": models, code, and papers
Alert button

Leveraging neural representations for facilitating access to untranscribed speech from endangered languages

Add code
Bookmark button
Alert button
Mar 26, 2021
Nay San, Martijn Bartelds, Mitchell Browne, Lily Clifford, Fiona Gibson, John Mansfield, David Nash, Jane Simpson, Myfany Turpin, Maria Vollmer, Sasha Wilmoth, Dan Jurafsky

Figure 1 for Leveraging neural representations for facilitating access to untranscribed speech from endangered languages
Figure 2 for Leveraging neural representations for facilitating access to untranscribed speech from endangered languages
Figure 3 for Leveraging neural representations for facilitating access to untranscribed speech from endangered languages
Viaarxiv icon

Mutually-Constrained Monotonic Multihead Attention for Online ASR

Add code
Bookmark button
Alert button
Mar 26, 2021
Jaeyun Song, Hajin Shim, Eunho Yang

Figure 1 for Mutually-Constrained Monotonic Multihead Attention for Online ASR
Figure 2 for Mutually-Constrained Monotonic Multihead Attention for Online ASR
Figure 3 for Mutually-Constrained Monotonic Multihead Attention for Online ASR
Viaarxiv icon

A Comparative Study of Self-supervised Speech Representation Based Voice Conversion

Add code
Bookmark button
Alert button
Jul 10, 2022
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Tomoki Toda

Figure 1 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 2 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 3 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 4 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Viaarxiv icon

CoVoST 2 and Massively Multilingual Speech-to-Text Translation

Add code
Bookmark button
Alert button
Aug 20, 2020
Changhan Wang, Anne Wu, Juan Pino

Figure 1 for CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Figure 2 for CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Figure 3 for CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Figure 4 for CoVoST 2 and Massively Multilingual Speech-to-Text Translation
Viaarxiv icon

Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks

Jun 24, 2022
Ahmet M. Elbir, Wei Shi, Kumar Vijay Mishra, Anastasios K. Papazafeiropoulos, Symeon Chatzinotas

Figure 1 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 2 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 3 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 4 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Viaarxiv icon

Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models

Apr 25, 2021
Thibault Doutre, Wei Han, Chung-Cheng Chiu, Ruoming Pang, Olivier Siohan, Liangliang Cao

Figure 1 for Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models
Figure 2 for Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models
Figure 3 for Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models
Figure 4 for Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models
Viaarxiv icon

Design and Optimization of a Speech Recognition Front-End for Distant-Talking Control of a Music Playback Device

May 05, 2014
Ramin Pichevar, Jason Wung, Daniele Giacobello, Joshua Atkins

Figure 1 for Design and Optimization of a Speech Recognition Front-End for Distant-Talking Control of a Music Playback Device
Figure 2 for Design and Optimization of a Speech Recognition Front-End for Distant-Talking Control of a Music Playback Device
Figure 3 for Design and Optimization of a Speech Recognition Front-End for Distant-Talking Control of a Music Playback Device
Figure 4 for Design and Optimization of a Speech Recognition Front-End for Distant-Talking Control of a Music Playback Device
Viaarxiv icon

Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain

Add code
Bookmark button
Alert button
Jun 16, 2021
Pengcheng Guo, Xuankai Chang, Shinji Watanabe, Lei Xie

Figure 1 for Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain
Figure 2 for Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain
Figure 3 for Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain
Figure 4 for Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain
Viaarxiv icon

Learning Discriminative features using Center Loss and Reconstruction as Regularizer for Speech Emotion Recognition

Jun 19, 2019
Suraj Tripathi, Abhiram Ramesh, Abhay Kumar, Chirag Singh, Promod Yenigalla

Figure 1 for Learning Discriminative features using Center Loss and Reconstruction as Regularizer for Speech Emotion Recognition
Figure 2 for Learning Discriminative features using Center Loss and Reconstruction as Regularizer for Speech Emotion Recognition
Figure 3 for Learning Discriminative features using Center Loss and Reconstruction as Regularizer for Speech Emotion Recognition
Figure 4 for Learning Discriminative features using Center Loss and Reconstruction as Regularizer for Speech Emotion Recognition
Viaarxiv icon

SDST: Successive Decoding for Speech-to-text Translation

Add code
Bookmark button
Alert button
Sep 21, 2020
Qianqian Dong, Mingxuan Wang, Hao Zhou, Shuang Xu, Bo Xu, Lei Li

Figure 1 for SDST: Successive Decoding for Speech-to-text Translation
Figure 2 for SDST: Successive Decoding for Speech-to-text Translation
Figure 3 for SDST: Successive Decoding for Speech-to-text Translation
Figure 4 for SDST: Successive Decoding for Speech-to-text Translation
Viaarxiv icon