Picture for Tom Bagby

Tom Bagby

LAST: Scalable Lattice-Based Speech Modelling in JAX

Add code
Apr 25, 2023
Figure 1 for LAST: Scalable Lattice-Based Speech Modelling in JAX
Figure 2 for LAST: Scalable Lattice-Based Speech Modelling in JAX
Figure 3 for LAST: Scalable Lattice-Based Speech Modelling in JAX
Viaarxiv icon

Learning the joint distribution of two sequences using little or no paired data

Add code
Dec 06, 2022
Figure 1 for Learning the joint distribution of two sequences using little or no paired data
Figure 2 for Learning the joint distribution of two sequences using little or no paired data
Figure 3 for Learning the joint distribution of two sequences using little or no paired data
Figure 4 for Learning the joint distribution of two sequences using little or no paired data
Viaarxiv icon

Speaker Generation

Add code
Nov 07, 2021
Figure 1 for Speaker Generation
Figure 2 for Speaker Generation
Figure 3 for Speaker Generation
Figure 4 for Speaker Generation
Viaarxiv icon

Non-saturating GAN training as divergence minimization

Add code
Oct 15, 2020
Figure 1 for Non-saturating GAN training as divergence minimization
Figure 2 for Non-saturating GAN training as divergence minimization
Figure 3 for Non-saturating GAN training as divergence minimization
Figure 4 for Non-saturating GAN training as divergence minimization
Viaarxiv icon

Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Add code
Oct 23, 2019
Figure 1 for Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Figure 2 for Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Figure 3 for Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Figure 4 for Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Viaarxiv icon

Semi-Supervised Generative Modeling for Controllable Speech Synthesis

Add code
Oct 03, 2019
Figure 1 for Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Figure 2 for Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Figure 3 for Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Figure 4 for Semi-Supervised Generative Modeling for Controllable Speech Synthesis
Viaarxiv icon

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Add code
Jul 09, 2019
Figure 1 for Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Figure 2 for Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Figure 3 for Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Figure 4 for Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis
Viaarxiv icon

Complex Evolution Recurrent Neural Networks (ceRNNs)

Add code
Jun 05, 2019
Figure 1 for Complex Evolution Recurrent Neural Networks (ceRNNs)
Figure 2 for Complex Evolution Recurrent Neural Networks (ceRNNs)
Figure 3 for Complex Evolution Recurrent Neural Networks (ceRNNs)
Figure 4 for Complex Evolution Recurrent Neural Networks (ceRNNs)
Viaarxiv icon

Streaming End-to-end Speech Recognition For Mobile Devices

Add code
Nov 15, 2018
Figure 1 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 2 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 3 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 4 for Streaming End-to-end Speech Recognition For Mobile Devices
Viaarxiv icon