Picture for Alan W Black

Alan W Black

A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution

Add code
Oct 27, 2022
Figure 1 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 2 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 3 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 4 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Viaarxiv icon

Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models

Add code
Oct 27, 2022
Viaarxiv icon

CTC Alignments Improve Autoregressive Translation

Add code
Oct 11, 2022
Figure 1 for CTC Alignments Improve Autoregressive Translation
Figure 2 for CTC Alignments Improve Autoregressive Translation
Figure 3 for CTC Alignments Improve Autoregressive Translation
Figure 4 for CTC Alignments Improve Autoregressive Translation
Viaarxiv icon

Deep Speech Synthesis from Articulatory Representations

Add code
Sep 13, 2022
Figure 1 for Deep Speech Synthesis from Articulatory Representations
Figure 2 for Deep Speech Synthesis from Articulatory Representations
Figure 3 for Deep Speech Synthesis from Articulatory Representations
Figure 4 for Deep Speech Synthesis from Articulatory Representations
Viaarxiv icon

ASR2K: Speech Recognition for Around 2000 Languages without Audio

Add code
Sep 06, 2022
Figure 1 for ASR2K: Speech Recognition for Around 2000 Languages without Audio
Figure 2 for ASR2K: Speech Recognition for Around 2000 Languages without Audio
Figure 3 for ASR2K: Speech Recognition for Around 2000 Languages without Audio
Figure 4 for ASR2K: Speech Recognition for Around 2000 Languages without Audio
Viaarxiv icon

Building African Voices

Add code
Jul 01, 2022
Figure 1 for Building African Voices
Figure 2 for Building African Voices
Figure 3 for Building African Voices
Viaarxiv icon

On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization

Add code
May 24, 2022
Figure 1 for On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Figure 2 for On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Figure 3 for On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Figure 4 for On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization
Viaarxiv icon

Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition

Add code
Apr 08, 2022
Figure 1 for Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition
Figure 2 for Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition
Figure 3 for Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition
Figure 4 for Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition
Viaarxiv icon

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet

Add code
Nov 29, 2021
Figure 1 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 2 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 3 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 4 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Viaarxiv icon

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Add code
Nov 02, 2021
Figure 1 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 2 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 3 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 4 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Viaarxiv icon