Picture for Alan W Black

Alan W Black

SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Add code
Oct 16, 2023
Figure 1 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 2 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 3 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 4 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Viaarxiv icon

Self-Supervised Models of Speech Infer Universal Articulatory Kinematics

Add code
Oct 16, 2023
Viaarxiv icon

Deep Speech Synthesis from MRI-Based Articulatory Representations

Add code
Jul 05, 2023
Figure 1 for Deep Speech Synthesis from MRI-Based Articulatory Representations
Figure 2 for Deep Speech Synthesis from MRI-Based Articulatory Representations
Figure 3 for Deep Speech Synthesis from MRI-Based Articulatory Representations
Figure 4 for Deep Speech Synthesis from MRI-Based Articulatory Representations
Viaarxiv icon

Speaker-Independent Acoustic-to-Articulatory Speech Inversion

Add code
Feb 14, 2023
Figure 1 for Speaker-Independent Acoustic-to-Articulatory Speech Inversion
Figure 2 for Speaker-Independent Acoustic-to-Articulatory Speech Inversion
Figure 3 for Speaker-Independent Acoustic-to-Articulatory Speech Inversion
Figure 4 for Speaker-Independent Acoustic-to-Articulatory Speech Inversion
Viaarxiv icon

Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization

Add code
Oct 29, 2022
Figure 1 for Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization
Figure 2 for Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization
Figure 3 for Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization
Viaarxiv icon

Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models

Add code
Oct 27, 2022
Figure 1 for Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models
Figure 2 for Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models
Figure 3 for Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models
Figure 4 for Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models
Viaarxiv icon

A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution

Add code
Oct 27, 2022
Figure 1 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 2 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 3 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 4 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Viaarxiv icon

CTC Alignments Improve Autoregressive Translation

Add code
Oct 11, 2022
Figure 1 for CTC Alignments Improve Autoregressive Translation
Figure 2 for CTC Alignments Improve Autoregressive Translation
Figure 3 for CTC Alignments Improve Autoregressive Translation
Figure 4 for CTC Alignments Improve Autoregressive Translation
Viaarxiv icon

Deep Speech Synthesis from Articulatory Representations

Add code
Sep 13, 2022
Figure 1 for Deep Speech Synthesis from Articulatory Representations
Figure 2 for Deep Speech Synthesis from Articulatory Representations
Figure 3 for Deep Speech Synthesis from Articulatory Representations
Figure 4 for Deep Speech Synthesis from Articulatory Representations
Viaarxiv icon

ASR2K: Speech Recognition for Around 2000 Languages without Audio

Add code
Sep 06, 2022
Figure 1 for ASR2K: Speech Recognition for Around 2000 Languages without Audio
Figure 2 for ASR2K: Speech Recognition for Around 2000 Languages without Audio
Figure 3 for ASR2K: Speech Recognition for Around 2000 Languages without Audio
Figure 4 for ASR2K: Speech Recognition for Around 2000 Languages without Audio
Viaarxiv icon