Alert button
Picture for Yerbolat Khassanov

Yerbolat Khassanov

Alert button

SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams

Add code
Bookmark button
Alert button
Dec 05, 2020
Madina Abdrakhmanova, Askat Kuzdeuov, Sheikh Jarju, Yerbolat Khassanov, Michael Lewis, Huseyin Atakan Varol

Figure 1 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 2 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 3 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Figure 4 for SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
Viaarxiv icon

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline

Add code
Bookmark button
Alert button
Sep 22, 2020
Yerbolat Khassanov, Saida Mussakhojayeva, Almas Mirzakhmetov, Alen Adiyev, Mukhamet Nurpeiissov, Huseyin Atakan Varol

Figure 1 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 2 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 3 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 4 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Viaarxiv icon

Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning

Add code
Bookmark button
Alert button
May 28, 2020
Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma

Figure 1 for Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning
Figure 2 for Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning
Figure 3 for Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning
Figure 4 for Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning
Viaarxiv icon

Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems

Add code
Bookmark button
Alert button
May 18, 2020
Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng

Figure 1 for Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems
Figure 2 for Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems
Figure 3 for Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems
Viaarxiv icon

Independent language modeling architecture for end-to-end ASR

Add code
Bookmark button
Alert button
Nov 25, 2019
Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li

Figure 1 for Independent language modeling architecture for end-to-end ASR
Figure 2 for Independent language modeling architecture for end-to-end ASR
Figure 3 for Independent language modeling architecture for end-to-end ASR
Figure 4 for Independent language modeling architecture for end-to-end ASR
Viaarxiv icon

Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data

Add code
Bookmark button
Alert button
Apr 08, 2019
Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma

Figure 1 for Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data
Figure 2 for Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data
Figure 3 for Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data
Figure 4 for Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data
Viaarxiv icon

Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation

Add code
Bookmark button
Alert button
Apr 08, 2019
Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng

Figure 1 for Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation
Figure 2 for Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation
Figure 3 for Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation
Figure 4 for Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation
Viaarxiv icon

On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition

Add code
Bookmark button
Alert button
Nov 01, 2018
Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li

Figure 1 for On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Figure 2 for On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Figure 3 for On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Figure 4 for On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition
Viaarxiv icon

Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR

Add code
Bookmark button
Alert button
Jun 27, 2018
Yerbolat Khassanov, Eng Siong Chng

Figure 1 for Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
Figure 2 for Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
Figure 3 for Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
Viaarxiv icon