Alert button

"speech recognition": models, code, and papers
Alert button

Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization

Nov 29, 2021
Brian Yan, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Siddharth Dalmia, Dan Berrebbi, Chao Weng, Shinji Watanabe, Dong Yu

Figure 1 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 2 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 3 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Figure 4 for Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Viaarxiv icon

A study on cross-corpus speech emotion recognition and data augmentation

Add code
Bookmark button
Alert button
Jan 10, 2022
Norbert Braunschweiler, Rama Doddipatla, Simon Keizer, Svetlana Stoyanchev

Figure 1 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 2 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 3 for A study on cross-corpus speech emotion recognition and data augmentation
Figure 4 for A study on cross-corpus speech emotion recognition and data augmentation
Viaarxiv icon

Speaker Attentive Speech Emotion Recognition

Apr 15, 2021
Clément Le Moine, Nicolas Obin, Axel Roebel

Figure 1 for Speaker Attentive Speech Emotion Recognition
Figure 2 for Speaker Attentive Speech Emotion Recognition
Figure 3 for Speaker Attentive Speech Emotion Recognition
Figure 4 for Speaker Attentive Speech Emotion Recognition
Viaarxiv icon

MobiVSR: A Visual Speech Recognition Solution for Mobile Devices

Jun 05, 2019
Nilay Shrivastava, Astitwa Saxena, Yaman Kumar, Rajiv Ratn Shah, Debanjan Mahata, Amanda Stent

Figure 1 for MobiVSR: A Visual Speech Recognition Solution for Mobile Devices
Figure 2 for MobiVSR: A Visual Speech Recognition Solution for Mobile Devices
Figure 3 for MobiVSR: A Visual Speech Recognition Solution for Mobile Devices
Figure 4 for MobiVSR: A Visual Speech Recognition Solution for Mobile Devices
Viaarxiv icon

Distilling a Pretrained Language Model to a Multilingual ASR Model

Add code
Bookmark button
Alert button
Jun 25, 2022
Kwanghee Choi, Hyung-Min Park

Figure 1 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 2 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 3 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 4 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Viaarxiv icon

Detecting Adversarial Examples in Batches -- a geometrical approach

Add code
Bookmark button
Alert button
Jun 17, 2022
Danush Kumar Venkatesh, Peter Steinbach

Figure 1 for Detecting Adversarial Examples in Batches -- a geometrical approach
Figure 2 for Detecting Adversarial Examples in Batches -- a geometrical approach
Figure 3 for Detecting Adversarial Examples in Batches -- a geometrical approach
Figure 4 for Detecting Adversarial Examples in Batches -- a geometrical approach
Viaarxiv icon

HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions

Add code
Bookmark button
Alert button
Sep 18, 2022
Lingjiao Chen, Zhihua Jin, Sabri Eyuboglu, Christopher Ré, Matei Zaharia, James Zou

Figure 1 for HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions
Figure 2 for HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions
Figure 3 for HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions
Figure 4 for HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions
Viaarxiv icon

Lightweight and Efficient End-to-End Speech Recognition Using Low-Rank Transformer

Oct 30, 2019
Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin, Zihan Liu, Pascale Fung

Figure 1 for Lightweight and Efficient End-to-End Speech Recognition Using Low-Rank Transformer
Figure 2 for Lightweight and Efficient End-to-End Speech Recognition Using Low-Rank Transformer
Figure 3 for Lightweight and Efficient End-to-End Speech Recognition Using Low-Rank Transformer
Figure 4 for Lightweight and Efficient End-to-End Speech Recognition Using Low-Rank Transformer
Viaarxiv icon

End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system

Feb 18, 2022
Zhengyi Zhang, Pan Zhou

Figure 1 for End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system
Figure 2 for End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system
Figure 3 for End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system
Figure 4 for End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system
Viaarxiv icon

Streaming End-to-end Speech Recognition For Mobile Devices

Add code
Bookmark button
Alert button
Nov 15, 2018
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-yiin Chang, Kanishka Rao, Alexander Gruenstein

Figure 1 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 2 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 3 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 4 for Streaming End-to-end Speech Recognition For Mobile Devices
Viaarxiv icon