Alert button

"speech": models, code, and papers
Alert button

LiRA: Learning Visual Speech Representations from Audio through Self-supervision

Jun 16, 2021
Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Björn W. Schuller, Maja Pantic

Figure 1 for LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Figure 2 for LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Figure 3 for LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Figure 4 for LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Viaarxiv icon

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

Add code
Bookmark button
Alert button
Feb 20, 2021
Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie

Figure 1 for The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
Figure 2 for The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
Figure 3 for The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods
Viaarxiv icon

Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion

Add code
Bookmark button
Alert button
Sep 14, 2022
Venkata S Govindarajan, Katherine Atwell, Barea Sinno, Malihe Alikhani, David I. Beaver, Junyi Jessy Li

Figure 1 for Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion
Figure 2 for Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion
Figure 3 for Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion
Figure 4 for Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion
Viaarxiv icon

Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems

Dec 03, 2021
Xiaoliang Wu, Ajitha Rajan

Figure 1 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 2 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 3 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 4 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Viaarxiv icon

Design of a novel Korean learning application for efficient pronunciation correction

May 04, 2022
Minjong Cheon, Minseon Kim, Hanseon Joo

Figure 1 for Design of a novel Korean learning application for efficient pronunciation correction
Figure 2 for Design of a novel Korean learning application for efficient pronunciation correction
Figure 3 for Design of a novel Korean learning application for efficient pronunciation correction
Figure 4 for Design of a novel Korean learning application for efficient pronunciation correction
Viaarxiv icon

Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring

Add code
Bookmark button
Alert button
Sep 09, 2021
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe

Figure 1 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 2 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 3 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 4 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Viaarxiv icon

Thai Wav2Vec2.0 with CommonVoice V8

Add code
Bookmark button
Alert button
Aug 09, 2022
Wannaphong Phatthiyaphaibun, Chompakorn Chaksangchaichot, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong

Figure 1 for Thai Wav2Vec2.0 with CommonVoice V8
Figure 2 for Thai Wav2Vec2.0 with CommonVoice V8
Viaarxiv icon

MLS: A Large-Scale Multilingual Dataset for Speech Research

Add code
Bookmark button
Alert button
Dec 19, 2020
Vineel Pratap, Qiantong Xu, Anuroop Sriram, Gabriel Synnaeve, Ronan Collobert

Figure 1 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 2 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 3 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Figure 4 for MLS: A Large-Scale Multilingual Dataset for Speech Research
Viaarxiv icon

Speech Quality Assessment in Crowdsourcing: Comparison Category Rating Method

Add code
Bookmark button
Alert button
Apr 09, 2021
Babak Naderi, Sebastian Möller, Ross Cutler

Figure 1 for Speech Quality Assessment in Crowdsourcing: Comparison Category Rating Method
Figure 2 for Speech Quality Assessment in Crowdsourcing: Comparison Category Rating Method
Figure 3 for Speech Quality Assessment in Crowdsourcing: Comparison Category Rating Method
Figure 4 for Speech Quality Assessment in Crowdsourcing: Comparison Category Rating Method
Viaarxiv icon

Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks

Oct 29, 2020
Masood S. Mortazavi

Figure 1 for Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
Figure 2 for Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
Figure 3 for Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
Figure 4 for Speech-Image Semantic Alignment Does Not Depend on Any Prior Classification Tasks
Viaarxiv icon