Alert button

"speech": models, code, and papers
Alert button

SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training

Add code
Bookmark button
Alert button
Oct 07, 2022
Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu, Lirong Dai, Jinyu Li, Furu Wei

Figure 1 for SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Figure 2 for SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Figure 3 for SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Figure 4 for SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Viaarxiv icon

Evaluation of the syllables pronunciation quality in speech rehabilitation through the solution of the classification problem

Jan 25, 2023
Evgeny Kostyuchenko

Figure 1 for Evaluation of the syllables pronunciation quality in speech rehabilitation through the solution of the classification problem
Figure 2 for Evaluation of the syllables pronunciation quality in speech rehabilitation through the solution of the classification problem
Figure 3 for Evaluation of the syllables pronunciation quality in speech rehabilitation through the solution of the classification problem
Viaarxiv icon

Speech MOS multi-task learning and rater bias correction

Dec 04, 2022
Haleh Akrami, Hannes Gamper

Figure 1 for Speech MOS multi-task learning and rater bias correction
Figure 2 for Speech MOS multi-task learning and rater bias correction
Figure 3 for Speech MOS multi-task learning and rater bias correction
Figure 4 for Speech MOS multi-task learning and rater bias correction
Viaarxiv icon

Learning from Invalid Data: On Constraint Satisfaction in Generative Models

Jun 27, 2023
Giorgio Giannone, Lyle Regenwetter, Akash Srivastava, Dan Gutfreund, Faez Ahmed

Figure 1 for Learning from Invalid Data: On Constraint Satisfaction in Generative Models
Figure 2 for Learning from Invalid Data: On Constraint Satisfaction in Generative Models
Figure 3 for Learning from Invalid Data: On Constraint Satisfaction in Generative Models
Figure 4 for Learning from Invalid Data: On Constraint Satisfaction in Generative Models
Viaarxiv icon

Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute

Add code
Bookmark button
Alert button
Jun 11, 2023
William Chen, Xuankai Chang, Yifan Peng, Zhaoheng Ni, Soumi Maiti, Shinji Watanabe

Figure 1 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 2 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 3 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Figure 4 for Reducing Barriers to Self-Supervised Learning: HuBERT Pre-training with Academic Compute
Viaarxiv icon

LEACE: Perfect linear concept erasure in closed form

Add code
Bookmark button
Alert button
Jun 06, 2023
Nora Belrose, David Schneider-Joseph, Shauli Ravfogel, Ryan Cotterell, Edward Raff, Stella Biderman

Figure 1 for LEACE: Perfect linear concept erasure in closed form
Figure 2 for LEACE: Perfect linear concept erasure in closed form
Figure 3 for LEACE: Perfect linear concept erasure in closed form
Figure 4 for LEACE: Perfect linear concept erasure in closed form
Viaarxiv icon

Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics

Add code
Bookmark button
Alert button
Jun 06, 2023
Bo Molenaar, Cristian Tejedor-Garcia, Helmer Strik, Catia Cucchiarini

Figure 1 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 2 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 3 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 4 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Viaarxiv icon

Speech Driven Video Editing via an Audio-Conditioned Diffusion Model

Add code
Bookmark button
Alert button
Jan 12, 2023
Dan Bigioi, Shubhajit Basak, Hugh Jordan, Rachel McDonnell, Peter Corcoran

Figure 1 for Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Figure 2 for Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Figure 3 for Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Figure 4 for Speech Driven Video Editing via an Audio-Conditioned Diffusion Model
Viaarxiv icon

Learning From Yourself: A Self-Distillation Method for Fake Speech Detection

Mar 02, 2023
Jun Xue, Cunhang Fan, Jiangyan Yi, Chenglong Wang, Zhengqi Wen, Dan Zhang, Zhao Lv

Figure 1 for Learning From Yourself: A Self-Distillation Method for Fake Speech Detection
Figure 2 for Learning From Yourself: A Self-Distillation Method for Fake Speech Detection
Figure 3 for Learning From Yourself: A Self-Distillation Method for Fake Speech Detection
Figure 4 for Learning From Yourself: A Self-Distillation Method for Fake Speech Detection
Viaarxiv icon

Record Deduplication for Entity Distribution Modeling in ASR Transcripts

Jun 09, 2023
Tianyu Huang, Chung Hoon Hong, Carl Wivagg, Kanna Shimizu

Figure 1 for Record Deduplication for Entity Distribution Modeling in ASR Transcripts
Figure 2 for Record Deduplication for Entity Distribution Modeling in ASR Transcripts
Figure 3 for Record Deduplication for Entity Distribution Modeling in ASR Transcripts
Figure 4 for Record Deduplication for Entity Distribution Modeling in ASR Transcripts
Viaarxiv icon