Alert button

"speech recognition": models, code, and papers
Alert button

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition

Feb 07, 2018
Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson

Figure 1 for Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Figure 2 for Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Figure 3 for Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Figure 4 for Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Viaarxiv icon

Trace norm regularization and faster inference for embedded speech recognition RNNs

Add code
Bookmark button
Alert button
Feb 06, 2018
Markus Kliegl, Siddharth Goyal, Kexin Zhao, Kavya Srinet, Mohammad Shoeybi

Figure 1 for Trace norm regularization and faster inference for embedded speech recognition RNNs
Figure 2 for Trace norm regularization and faster inference for embedded speech recognition RNNs
Figure 3 for Trace norm regularization and faster inference for embedded speech recognition RNNs
Figure 4 for Trace norm regularization and faster inference for embedded speech recognition RNNs
Viaarxiv icon

Cloud-Based Face and Speech Recognition for Access Control Applications

May 08, 2020
Nathalie Tkauc, Thao Tran, Kevin Hernandez-Diaz, Fernando Alonso-Fernandez

Figure 1 for Cloud-Based Face and Speech Recognition for Access Control Applications
Figure 2 for Cloud-Based Face and Speech Recognition for Access Control Applications
Figure 3 for Cloud-Based Face and Speech Recognition for Access Control Applications
Figure 4 for Cloud-Based Face and Speech Recognition for Access Control Applications
Viaarxiv icon

Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos

Add code
Bookmark button
Alert button
Jun 09, 2022
Alexander Waibel, Moritz Behr, Fevziye Irem Eyiokur, Dogucan Yaman, Tuan-Nam Nguyen, Carlos Mullov, Mehmet Arif Demirtas, Alperen Kantarcı, Stefan Constantin, Hazım Kemal Ekenel

Figure 1 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 2 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 3 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 4 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Viaarxiv icon

Accelerating recurrent neural network language model based online speech recognition system

Jan 30, 2018
Kyungmin Lee, Chiyoun Park, Namhoon Kim, Jaewon Lee

Figure 1 for Accelerating recurrent neural network language model based online speech recognition system
Figure 2 for Accelerating recurrent neural network language model based online speech recognition system
Figure 3 for Accelerating recurrent neural network language model based online speech recognition system
Figure 4 for Accelerating recurrent neural network language model based online speech recognition system
Viaarxiv icon

Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition

Oct 24, 2019
Zheng Lian, Jianhua Tao, Bin Liu, Jian Huang

Figure 1 for Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Figure 2 for Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Figure 3 for Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Figure 4 for Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Viaarxiv icon

Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator

May 18, 2022
Guangzhi Sun, Chao Zhang, Philip C Woodland

Figure 1 for Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 2 for Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 3 for Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 4 for Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
Viaarxiv icon

Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition

Apr 07, 2022
Qijie Shao, Jinghao Yan, Jian Kang, Pengcheng Guo, Xian Shi, Pengfei Hu, Lei Xie

Figure 1 for Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition
Figure 2 for Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition
Figure 3 for Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition
Figure 4 for Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition
Viaarxiv icon

Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model

Add code
Bookmark button
Alert button
Apr 07, 2022
Nick J. C. Wang, Lu Wang, Yandan Sun, Haimei Kang, Dejun Zhang

Figure 1 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 2 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 3 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Figure 4 for Three-Module Modeling For End-to-End Spoken Language Understanding Using Pre-trained DNN-HMM-Based Acoustic-Phonetic Model
Viaarxiv icon

A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition

Dec 01, 2022
Biao Ma, Chengben Xu, Ye Zhang

Figure 1 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 2 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 3 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Figure 4 for A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition
Viaarxiv icon