Picture for Jan "Honza'' Černocký

Jan "Honza'' Černocký

Speaker adaptation for Wav2vec2 based dysarthric ASR

Add code
Apr 02, 2022
Figure 1 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 2 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 3 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 4 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Viaarxiv icon

Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries

Add code
Mar 29, 2022
Figure 1 for Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries
Figure 2 for Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries
Figure 3 for Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries
Viaarxiv icon

EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition

Add code
Apr 13, 2021
Figure 1 for EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition
Figure 2 for EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition
Figure 3 for EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition
Figure 4 for EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition
Viaarxiv icon

BUT Opensat 2019 Speech Recognition System

Add code
Jan 30, 2020
Figure 1 for BUT Opensat 2019 Speech Recognition System
Figure 2 for BUT Opensat 2019 Speech Recognition System
Figure 3 for BUT Opensat 2019 Speech Recognition System
Figure 4 for BUT Opensat 2019 Speech Recognition System
Viaarxiv icon

A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database

Add code
Dec 08, 2019
Figure 1 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 2 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 3 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 4 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Viaarxiv icon

Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge

Add code
Jul 13, 2019
Figure 1 for Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge
Figure 2 for Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge
Figure 3 for Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge
Figure 4 for Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge
Viaarxiv icon

Analysis of Multilingual Sequence-to-Sequence speech recognition systems

Add code
Nov 07, 2018
Figure 1 for Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Figure 2 for Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Figure 3 for Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Figure 4 for Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Viaarxiv icon