Alert button

"speech": models, code, and papers
Alert button

Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training

Nov 26, 2020
Sameer Khurana, Niko Moritz, Takaaki Hori, Jonathan Le Roux

Figure 1 for Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Figure 2 for Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Figure 3 for Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Figure 4 for Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Viaarxiv icon

Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals

Feb 19, 2021
Nils Poschadel, Robert Hupke, Stephan Preihs, Jürgen Peissig

Figure 1 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 2 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 3 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Figure 4 for Direction of Arrival Estimation of Noisy Speech Using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals
Viaarxiv icon

North America Bixby Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021

Add code
Bookmark button
Alert button
Sep 28, 2021
Myungjong Kim, Taeyeon Ki, Aviral Anshu, Vijendra Raj Apsingekar

Figure 1 for North America Bixby Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021
Figure 2 for North America Bixby Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021
Figure 3 for North America Bixby Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2021
Viaarxiv icon

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Add code
Bookmark button
Alert button
Jul 21, 2021
Yinghao Aaron Li, Ali Zare, Nima Mesgarani

Figure 1 for StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Figure 2 for StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Figure 3 for StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Figure 4 for StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
Viaarxiv icon

Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model

Add code
Bookmark button
Alert button
Feb 22, 2021
Junwei Liao, Yu Shi, Ming Gong, Linjun Shou, Sefik Eskimez, Liyang Lu, Hong Qu, Michael Zeng

Figure 1 for Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Figure 2 for Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Figure 3 for Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Viaarxiv icon

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

Add code
Bookmark button
Alert button
May 20, 2020
Dongwei Jiang, Wubo Li, Ruixiong Zhang, Miao Cao, Ne Luo, Yang Han, Wei Zou, Xiangang Li

Figure 1 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 2 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 3 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Figure 4 for A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Viaarxiv icon

Towards speech-to-text translation without speech recognition

Add code
Bookmark button
Alert button
Feb 13, 2017
Sameer Bansal, Herman Kamper, Adam Lopez, Sharon Goldwater

Figure 1 for Towards speech-to-text translation without speech recognition
Figure 2 for Towards speech-to-text translation without speech recognition
Figure 3 for Towards speech-to-text translation without speech recognition
Figure 4 for Towards speech-to-text translation without speech recognition
Viaarxiv icon

ADIMA: Abuse Detection In Multilingual Audio

Add code
Bookmark button
Alert button
Feb 16, 2022
Vikram Gupta, Rini Sharon, Ramit Sawhney, Debdoot Mukherjee

Figure 1 for ADIMA: Abuse Detection In Multilingual Audio
Figure 2 for ADIMA: Abuse Detection In Multilingual Audio
Figure 3 for ADIMA: Abuse Detection In Multilingual Audio
Figure 4 for ADIMA: Abuse Detection In Multilingual Audio
Viaarxiv icon

BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge

Add code
Bookmark button
Alert button
Dec 03, 2021
Yuting Yang, Binbin Du, Yingxin Zhang, Wenxuan Wang, Yuke Li

Figure 1 for BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge
Figure 2 for BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge
Figure 3 for BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge
Viaarxiv icon

ESPnet: End-to-End Speech Processing Toolkit

Add code
Bookmark button
Alert button
Mar 30, 2018
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai

Figure 1 for ESPnet: End-to-End Speech Processing Toolkit
Figure 2 for ESPnet: End-to-End Speech Processing Toolkit
Figure 3 for ESPnet: End-to-End Speech Processing Toolkit
Figure 4 for ESPnet: End-to-End Speech Processing Toolkit
Viaarxiv icon