Alert button

"speech": models, code, and papers
Alert button

Automated detection of pronunciation errors in non-native English speech employing deep learning

Sep 13, 2022
Daniel Korzekwa

Viaarxiv icon

Multi-View Attention Transfer for Efficient Speech Enhancement

Aug 22, 2022
Wooseok Shin, Hyun Joon Park, Jin Sob Kim, Byung Hoon Lee, Sung Won Han

Figure 1 for Multi-View Attention Transfer for Efficient Speech Enhancement
Figure 2 for Multi-View Attention Transfer for Efficient Speech Enhancement
Figure 3 for Multi-View Attention Transfer for Efficient Speech Enhancement
Figure 4 for Multi-View Attention Transfer for Efficient Speech Enhancement
Viaarxiv icon

Application of Knowledge Distillation to Multi-task Speech Representation Learning

Oct 29, 2022
Mine Kerpicci, Van Nguyen, Shuhua Zhang, Erik Visser

Figure 1 for Application of Knowledge Distillation to Multi-task Speech Representation Learning
Figure 2 for Application of Knowledge Distillation to Multi-task Speech Representation Learning
Figure 3 for Application of Knowledge Distillation to Multi-task Speech Representation Learning
Figure 4 for Application of Knowledge Distillation to Multi-task Speech Representation Learning
Viaarxiv icon

Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation

Add code
Bookmark button
Alert button
May 18, 2022
Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Qibing Bai, Yu Zhang

Figure 1 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 2 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 3 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Figure 4 for Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation
Viaarxiv icon

Adversarial Privacy Protection on Speech Enhancement

Add code
Bookmark button
Alert button
Jun 16, 2022
Mingyu Dong, Diqun Yan, Rangding Wang

Figure 1 for Adversarial Privacy Protection on Speech Enhancement
Figure 2 for Adversarial Privacy Protection on Speech Enhancement
Figure 3 for Adversarial Privacy Protection on Speech Enhancement
Figure 4 for Adversarial Privacy Protection on Speech Enhancement
Viaarxiv icon

DDKtor: Automatic Diadochokinetic Speech Analysis

Add code
Bookmark button
Alert button
Jun 29, 2022
Yael Segal, Kasia Hitczenko, Matthew Goldrick, Adam Buchwald, Angela Roberts, Joseph Keshet

Figure 1 for DDKtor: Automatic Diadochokinetic Speech Analysis
Figure 2 for DDKtor: Automatic Diadochokinetic Speech Analysis
Figure 3 for DDKtor: Automatic Diadochokinetic Speech Analysis
Figure 4 for DDKtor: Automatic Diadochokinetic Speech Analysis
Viaarxiv icon

OLISIA: a Cascade System for Spoken Dialogue State Tracking

Add code
Bookmark button
Alert button
Apr 20, 2023
Léo Jacqmin, Lucas Druart, Valentin Vielzeuf, Lina Maria Rojas-Barahona, Yannick Estève, Benoît Favre

Figure 1 for OLISIA: a Cascade System for Spoken Dialogue State Tracking
Figure 2 for OLISIA: a Cascade System for Spoken Dialogue State Tracking
Figure 3 for OLISIA: a Cascade System for Spoken Dialogue State Tracking
Figure 4 for OLISIA: a Cascade System for Spoken Dialogue State Tracking
Viaarxiv icon

From Audio to Symbolic Encoding

Add code
Bookmark button
Alert button
Feb 26, 2023
Shenli Yuan, Lingjie Kong, Jiushuang Guo

Figure 1 for From Audio to Symbolic Encoding
Figure 2 for From Audio to Symbolic Encoding
Figure 3 for From Audio to Symbolic Encoding
Figure 4 for From Audio to Symbolic Encoding
Viaarxiv icon

Two-Stream Joint-Training for Speaker Independent Acoustic-to-Articulatory Inversion

Add code
Bookmark button
Alert button
Feb 26, 2023
Jianrong Wang, Jinyu Liu, Li Liu, Xuewei Li, Mei Yu, Jie Gao, Qiang Fang

Figure 1 for Two-Stream Joint-Training for Speaker Independent Acoustic-to-Articulatory Inversion
Figure 2 for Two-Stream Joint-Training for Speaker Independent Acoustic-to-Articulatory Inversion
Figure 3 for Two-Stream Joint-Training for Speaker Independent Acoustic-to-Articulatory Inversion
Figure 4 for Two-Stream Joint-Training for Speaker Independent Acoustic-to-Articulatory Inversion
Viaarxiv icon

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers

Jul 05, 2022
Ali Siahkoohi, Michael Chinen, Tom Denton, W. Bastiaan Kleijn, Jan Skoglund

Figure 1 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Figure 2 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Figure 3 for Ultra-Low-Bitrate Speech Coding with Pretrained Transformers
Viaarxiv icon