Pytorch Kaldi


PyTorch Kaldi is a toolkit for speech recognition that integrates PyTorch and Kaldi for building end-to-end speech recognition systems.

ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration

Add code
Sep 14, 2024
Viaarxiv icon

Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability

Add code
Jul 01, 2021
Figure 1 for Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability
Figure 2 for Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability
Figure 3 for Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability
Figure 4 for Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability
Viaarxiv icon

A Parallelizable Lattice Rescoring Strategy with Neural Language Models

Add code
Mar 08, 2021
Figure 1 for A Parallelizable Lattice Rescoring Strategy with Neural Language Models
Figure 2 for A Parallelizable Lattice Rescoring Strategy with Neural Language Models
Figure 3 for A Parallelizable Lattice Rescoring Strategy with Neural Language Models
Figure 4 for A Parallelizable Lattice Rescoring Strategy with Neural Language Models
Viaarxiv icon

Lhotse: a speech data representation library for the modern deep learning ecosystem

Add code
Oct 25, 2021
Figure 1 for Lhotse: a speech data representation library for the modern deep learning ecosystem
Figure 2 for Lhotse: a speech data representation library for the modern deep learning ecosystem
Viaarxiv icon

PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR

Add code
May 20, 2020
Figure 1 for PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
Figure 2 for PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
Figure 3 for PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
Figure 4 for PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
Viaarxiv icon

AP20-OLR Challenge: Three Tasks and Their Baselines

Add code
Jun 04, 2020
Figure 1 for AP20-OLR Challenge: Three Tasks and Their Baselines
Figure 2 for AP20-OLR Challenge: Three Tasks and Their Baselines
Figure 3 for AP20-OLR Challenge: Three Tasks and Their Baselines
Viaarxiv icon

PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch

Add code
Jul 30, 2019
Figure 1 for PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch
Figure 2 for PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch
Figure 3 for PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch
Figure 4 for PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch
Viaarxiv icon

The PyTorch-Kaldi Speech Recognition Toolkit

Add code
Nov 19, 2018
Figure 1 for The PyTorch-Kaldi Speech Recognition Toolkit
Figure 2 for The PyTorch-Kaldi Speech Recognition Toolkit
Figure 3 for The PyTorch-Kaldi Speech Recognition Toolkit
Figure 4 for The PyTorch-Kaldi Speech Recognition Toolkit
Viaarxiv icon

ESPnet: End-to-End Speech Processing Toolkit

Add code
Mar 30, 2018
Figure 1 for ESPnet: End-to-End Speech Processing Toolkit
Figure 2 for ESPnet: End-to-End Speech Processing Toolkit
Figure 3 for ESPnet: End-to-End Speech Processing Toolkit
Figure 4 for ESPnet: End-to-End Speech Processing Toolkit
Viaarxiv icon