Alert button

"speech recognition": models, code, and papers
Alert button

Investigations on End-to-End Audiovisual Fusion

Apr 30, 2018
Michael Wand, Ngoc Thang Vu, Juergen Schmidhuber

Figure 1 for Investigations on End-to-End Audiovisual Fusion
Figure 2 for Investigations on End-to-End Audiovisual Fusion
Figure 3 for Investigations on End-to-End Audiovisual Fusion
Figure 4 for Investigations on End-to-End Audiovisual Fusion
Viaarxiv icon

Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration

Mar 04, 2021
Han Li, Sunghyun Park, Aswarth Dara, Jinseok Nam, Sungjin Lee, Young-Bum Kim, Spyros Matsoukas, Ruhi Sarikaya

Figure 1 for Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration
Figure 2 for Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration
Figure 3 for Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration
Figure 4 for Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration
Viaarxiv icon

Iterative Compression of End-to-End ASR Model using AutoML

Aug 06, 2020
Abhinav Mehrotra, Łukasz Dudziak, Jinsu Yeo, Young-yoon Lee, Ravichander Vipperla, Mohamed S. Abdelfattah, Sourav Bhattacharya, Samin Ishtiaq, Alberto Gil C. P. Ramos, SangJeong Lee, Daehyun Kim, Nicholas D. Lane

Figure 1 for Iterative Compression of End-to-End ASR Model using AutoML
Figure 2 for Iterative Compression of End-to-End ASR Model using AutoML
Figure 3 for Iterative Compression of End-to-End ASR Model using AutoML
Figure 4 for Iterative Compression of End-to-End ASR Model using AutoML
Viaarxiv icon

Unsupervised Cross-Domain Singing Voice Conversion

Add code
Bookmark button
Alert button
Aug 06, 2020
Adam Polyak, Lior Wolf, Yossi Adi, Yaniv Taigman

Figure 1 for Unsupervised Cross-Domain Singing Voice Conversion
Figure 2 for Unsupervised Cross-Domain Singing Voice Conversion
Figure 3 for Unsupervised Cross-Domain Singing Voice Conversion
Viaarxiv icon

A New Amharic Speech Emotion Dataset and Classification Benchmark

Add code
Bookmark button
Alert button
Jan 07, 2022
Ephrem A. Retta, Eiad Almekhlafi, Richard Sutcliffe, Mustafa Mhamed, Haider Ali, Jun Feng

Figure 1 for A New Amharic Speech Emotion Dataset and Classification Benchmark
Figure 2 for A New Amharic Speech Emotion Dataset and Classification Benchmark
Figure 3 for A New Amharic Speech Emotion Dataset and Classification Benchmark
Figure 4 for A New Amharic Speech Emotion Dataset and Classification Benchmark
Viaarxiv icon

Detecting Audio Attacks on ASR Systems with Dropout Uncertainty

Jun 02, 2020
Tejas Jayashankar, Jonathan Le Roux, Pierre Moulin

Figure 1 for Detecting Audio Attacks on ASR Systems with Dropout Uncertainty
Figure 2 for Detecting Audio Attacks on ASR Systems with Dropout Uncertainty
Viaarxiv icon

Continuous Speech Separation with Ad Hoc Microphone Arrays

Mar 03, 2021
Dongmei Wang, Takuya Yoshioka, Zhuo Chen, Xiaofei Wang, Tianyan Zhou, Zhong Meng

Figure 1 for Continuous Speech Separation with Ad Hoc Microphone Arrays
Figure 2 for Continuous Speech Separation with Ad Hoc Microphone Arrays
Figure 3 for Continuous Speech Separation with Ad Hoc Microphone Arrays
Figure 4 for Continuous Speech Separation with Ad Hoc Microphone Arrays
Viaarxiv icon

Multi-channel Multi-frame ADL-MVDR for Target Speech Separation

Add code
Bookmark button
Alert button
Dec 24, 2020
Zhuohuang Zhang, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Donald S. Williamson, Dong Yu

Figure 1 for Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Figure 2 for Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Figure 3 for Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Figure 4 for Multi-channel Multi-frame ADL-MVDR for Target Speech Separation
Viaarxiv icon

Estimation error analysis of deep learning on the regression problem on the variable exponent Besov space

Sep 27, 2020
Kazuma Tsuji, Taiji Suzuki

Figure 1 for Estimation error analysis of deep learning on the regression problem on the variable exponent Besov space
Figure 2 for Estimation error analysis of deep learning on the regression problem on the variable exponent Besov space
Figure 3 for Estimation error analysis of deep learning on the regression problem on the variable exponent Besov space
Figure 4 for Estimation error analysis of deep learning on the regression problem on the variable exponent Besov space
Viaarxiv icon

Cascaded Models With Cyclic Feedback For Direct Speech Translation

Add code
Bookmark button
Alert button
Oct 21, 2020
Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler

Figure 1 for Cascaded Models With Cyclic Feedback For Direct Speech Translation
Figure 2 for Cascaded Models With Cyclic Feedback For Direct Speech Translation
Figure 3 for Cascaded Models With Cyclic Feedback For Direct Speech Translation
Figure 4 for Cascaded Models With Cyclic Feedback For Direct Speech Translation
Viaarxiv icon