Alert button

"speech": models, code, and papers
Alert button

Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks

Jun 24, 2022
Ahmet M. Elbir, Wei Shi, Kumar Vijay Mishra, Anastasios K. Papazafeiropoulos, Symeon Chatzinotas

Figure 1 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 2 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 3 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 4 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Viaarxiv icon

Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model

Add code
Bookmark button
Alert button
Sep 02, 2022
Jennifer Drexler Fox, Natalie Delworth

Figure 1 for Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model
Figure 2 for Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model
Figure 3 for Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model
Viaarxiv icon

Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion

Add code
Bookmark button
Alert button
Sep 14, 2022
Venkata S Govindarajan, Katherine Atwell, Barea Sinno, Malihe Alikhani, David I. Beaver, Junyi Jessy Li

Figure 1 for Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion
Figure 2 for Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion
Figure 3 for Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion
Figure 4 for Dimensions of Interpersonal Dynamics in Text: Group Membership and Fine-grained Interpersonal Emotion
Viaarxiv icon

The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021

Jul 01, 2021
Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai

Figure 1 for The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
Figure 2 for The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
Figure 3 for The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
Figure 4 for The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
Viaarxiv icon

Thai Wav2Vec2.0 with CommonVoice V8

Add code
Bookmark button
Alert button
Aug 09, 2022
Wannaphong Phatthiyaphaibun, Chompakorn Chaksangchaichot, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong

Figure 1 for Thai Wav2Vec2.0 with CommonVoice V8
Figure 2 for Thai Wav2Vec2.0 with CommonVoice V8
Viaarxiv icon

Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation

May 08, 2021
Sunwoo Kim, Minje Kim

Figure 1 for Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation
Figure 2 for Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation
Figure 3 for Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation
Viaarxiv icon

Design of a novel Korean learning application for efficient pronunciation correction

May 04, 2022
Minjong Cheon, Minseon Kim, Hanseon Joo

Figure 1 for Design of a novel Korean learning application for efficient pronunciation correction
Figure 2 for Design of a novel Korean learning application for efficient pronunciation correction
Figure 3 for Design of a novel Korean learning application for efficient pronunciation correction
Figure 4 for Design of a novel Korean learning application for efficient pronunciation correction
Viaarxiv icon

LiRA: Learning Visual Speech Representations from Audio through Self-supervision

Jun 16, 2021
Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Björn W. Schuller, Maja Pantic

Figure 1 for LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Figure 2 for LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Figure 3 for LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Figure 4 for LiRA: Learning Visual Speech Representations from Audio through Self-supervision
Viaarxiv icon

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

Add code
Bookmark button
Alert button
Apr 02, 2021
Edresson Casanova, Christopher Shulby, Eren Gölge, Nicolas Michael Müller, Frederico Santos de Oliveira, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Maria Aluisio, Moacir Antonelli Ponti

Figure 1 for SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
Figure 2 for SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
Viaarxiv icon

Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems

Dec 03, 2021
Xiaoliang Wu, Ajitha Rajan

Figure 1 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 2 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 3 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 4 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Viaarxiv icon