Alert button
Picture for Yerbolat Khassanov

Yerbolat Khassanov

Alert button

Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration

Add code
Bookmark button
Alert button
May 25, 2023
Rustem Yeshpanov, Saida Mussakhojayeva, Yerbolat Khassanov

Figure 1 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Figure 2 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Figure 3 for Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
Viaarxiv icon

Improving short-video speech recognition using random utterance concatenation

Add code
Bookmark button
Alert button
Oct 28, 2022
Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Yist Lin, Tao Han, Tze Yuan Chong, Yi He, Zejun Ma

Figure 1 for Improving short-video speech recognition using random utterance concatenation
Figure 2 for Improving short-video speech recognition using random utterance concatenation
Figure 3 for Improving short-video speech recognition using random utterance concatenation
Figure 4 for Improving short-video speech recognition using random utterance concatenation
Viaarxiv icon

KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics

Add code
Bookmark button
Alert button
Jan 15, 2022
Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol

Figure 1 for KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Figure 2 for KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Figure 3 for KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Figure 4 for KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics
Viaarxiv icon

KazNERD: Kazakh Named Entity Recognition Dataset

Add code
Bookmark button
Alert button
Nov 26, 2021
Rustem Yeshpanov, Yerbolat Khassanov, Huseyin Atakan Varol

Figure 1 for KazNERD: Kazakh Named Entity Recognition Dataset
Figure 2 for KazNERD: Kazakh Named Entity Recognition Dataset
Figure 3 for KazNERD: Kazakh Named Entity Recognition Dataset
Figure 4 for KazNERD: Kazakh Named Entity Recognition Dataset
Viaarxiv icon

A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data

Add code
Bookmark button
Alert button
Oct 23, 2021
Madina Abdrakhmanova, Saniya Abushakimova, Yerbolat Khassanov, Huseyin Atakan Varol

Figure 1 for A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Figure 2 for A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Figure 3 for A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Figure 4 for A Study of Multimodal Person Verification Using Audio-Visual-Thermal Data
Viaarxiv icon

A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English

Add code
Bookmark button
Alert button
Aug 03, 2021
Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol

Figure 1 for A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English
Figure 2 for A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English
Figure 3 for A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English
Viaarxiv icon

USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments

Add code
Bookmark button
Alert button
Jul 30, 2021
Muhammadjon Musaev, Saida Mussakhojayeva, Ilyos Khujayorov, Yerbolat Khassanov, Mannon Ochilov, Huseyin Atakan Varol

Figure 1 for USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
Figure 2 for USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
Figure 3 for USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
Figure 4 for USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
Viaarxiv icon

KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset

Add code
Bookmark button
Alert button
Apr 26, 2021
Saida Mussakhojayeva, Aigerim Janaliyeva, Almas Mirzakhmetov, Yerbolat Khassanov, Huseyin Atakan Varol

Figure 1 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 2 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 3 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Figure 4 for KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset
Viaarxiv icon

SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams

Add code
Bookmark button
Alert button
Dec 18, 2020
Madina Abdrakhmanova, Askat Kuzdeuov, Sheikh Jarju, Yerbolat Khassanov, Michael Lewis, Huseyin Atakan Varol

Figure 1 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 2 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 3 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 4 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Viaarxiv icon