Alert button

"speech recognition": models, code, and papers
Alert button

Space-Efficient Representation of Entity-centric Query Language Models

Add code
Bookmark button
Alert button
Jun 29, 2022
Christophe Van Gysel, Mirko Hannemann, Ernest Pusateri, Youssef Oualil, Ilya Oparin

Figure 1 for Space-Efficient Representation of Entity-centric Query Language Models
Figure 2 for Space-Efficient Representation of Entity-centric Query Language Models
Figure 3 for Space-Efficient Representation of Entity-centric Query Language Models
Figure 4 for Space-Efficient Representation of Entity-centric Query Language Models
Viaarxiv icon

Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition

Jun 05, 2019
Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 2 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 3 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 4 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Viaarxiv icon

Multilingual Speech Recognition with Corpus Relatedness Sampling

Aug 02, 2019
Xinjian Li, Siddharth Dalmia, Alan W. Black, Florian Metze

Figure 1 for Multilingual Speech Recognition with Corpus Relatedness Sampling
Figure 2 for Multilingual Speech Recognition with Corpus Relatedness Sampling
Figure 3 for Multilingual Speech Recognition with Corpus Relatedness Sampling
Figure 4 for Multilingual Speech Recognition with Corpus Relatedness Sampling
Viaarxiv icon

Resource aware design of a deep convolutional-recurrent neural network for speech recognition through audio-visual sensor fusion

Mar 13, 2018
Matthijs Van keirsbilck, Bert Moons, Marian Verhelst

Figure 1 for Resource aware design of a deep convolutional-recurrent neural network for speech recognition through audio-visual sensor fusion
Figure 2 for Resource aware design of a deep convolutional-recurrent neural network for speech recognition through audio-visual sensor fusion
Figure 3 for Resource aware design of a deep convolutional-recurrent neural network for speech recognition through audio-visual sensor fusion
Figure 4 for Resource aware design of a deep convolutional-recurrent neural network for speech recognition through audio-visual sensor fusion
Viaarxiv icon

Adversarial synthesis based data-augmentation for code-switched spoken language identification

May 30, 2022
Parth Shastri, Chirag Patil, Poorval Wanere, Dr. Shrinivas Mahajan, Dr. Abhishek Bhatt, Dr. Hardik Sailor

Figure 1 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 2 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 3 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 4 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Viaarxiv icon

Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations

May 08, 2018
Prashanth Gurunath Shivakumar, Panayiotis Georgiou

Figure 1 for Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations
Figure 2 for Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations
Figure 3 for Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations
Figure 4 for Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations
Viaarxiv icon

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition

Feb 24, 2020
Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David Kung

Figure 1 for Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Figure 2 for Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Figure 3 for Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Figure 4 for Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition
Viaarxiv icon

Augmenting Bottleneck Features of Deep Neural Network Employing Motor State for Speech Recognition at Humanoid Robots

Aug 27, 2018
Moa Lee, Joon Hyuk Chang

Figure 1 for Augmenting Bottleneck Features of Deep Neural Network Employing Motor State for Speech Recognition at Humanoid Robots
Figure 2 for Augmenting Bottleneck Features of Deep Neural Network Employing Motor State for Speech Recognition at Humanoid Robots
Figure 3 for Augmenting Bottleneck Features of Deep Neural Network Employing Motor State for Speech Recognition at Humanoid Robots
Figure 4 for Augmenting Bottleneck Features of Deep Neural Network Employing Motor State for Speech Recognition at Humanoid Robots
Viaarxiv icon

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment

Add code
Bookmark button
Alert button
May 06, 2022
Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James Glass

Figure 1 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 2 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 3 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 4 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Viaarxiv icon

Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition

Jan 22, 2016
Amr Bakry, Ahmed Elgammal

Figure 1 for Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition
Figure 2 for Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition
Figure 3 for Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition
Figure 4 for Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition
Viaarxiv icon