Alert button

"speech recognition": models, code, and papers
Alert button

Minimizing Sequential Confusion Error in Speech Command Recognition

Jul 04, 2022
Zhanheng Yang, Hang Lv, Xiong Wang, Ao Zhang, Lei Xie

Figure 1 for Minimizing Sequential Confusion Error in Speech Command Recognition
Figure 2 for Minimizing Sequential Confusion Error in Speech Command Recognition
Viaarxiv icon

An Overview on Language Models: Recent Developments and Outlook

Add code
Bookmark button
Alert button
Mar 10, 2023
Chengwei Wei, Yun-Cheng Wang, Bin Wang, C. -C. Jay Kuo

Figure 1 for An Overview on Language Models: Recent Developments and Outlook
Figure 2 for An Overview on Language Models: Recent Developments and Outlook
Figure 3 for An Overview on Language Models: Recent Developments and Outlook
Figure 4 for An Overview on Language Models: Recent Developments and Outlook
Viaarxiv icon

Continuous Speech Recognition using EEG and Video

Dec 27, 2019
Gautam Krishna, Mason Carnahan, Co Tran, Ahmed H Tewfik

Figure 1 for Continuous Speech Recognition using EEG and Video
Figure 2 for Continuous Speech Recognition using EEG and Video
Figure 3 for Continuous Speech Recognition using EEG and Video
Figure 4 for Continuous Speech Recognition using EEG and Video
Viaarxiv icon

Towards the Universal Defense for Query-Based Audio Adversarial Attacks

Add code
Bookmark button
Alert button
Apr 20, 2023
Feng Guo, Zheng Sun, Yuxuan Chen, Lei Ju

Figure 1 for Towards the Universal Defense for Query-Based Audio Adversarial Attacks
Figure 2 for Towards the Universal Defense for Query-Based Audio Adversarial Attacks
Figure 3 for Towards the Universal Defense for Query-Based Audio Adversarial Attacks
Figure 4 for Towards the Universal Defense for Query-Based Audio Adversarial Attacks
Viaarxiv icon

Flowchase: a Mobile Application for Pronunciation Training

Jul 05, 2023
Noé Tits, Zoé Broisson

Figure 1 for Flowchase: a Mobile Application for Pronunciation Training
Figure 2 for Flowchase: a Mobile Application for Pronunciation Training
Figure 3 for Flowchase: a Mobile Application for Pronunciation Training
Viaarxiv icon

Visual Speech Recognition for Multiple Languages in the Wild

Add code
Bookmark button
Alert button
Feb 26, 2022
Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 2 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 3 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 4 for Visual Speech Recognition for Multiple Languages in the Wild
Viaarxiv icon

Modulation spectral features for speech emotion recognition using deep neural networks

Add code
Bookmark button
Alert button
Jan 14, 2023
Premjeet Singh, Md Sahidullah, Goutam Saha

Figure 1 for Modulation spectral features for speech emotion recognition using deep neural networks
Figure 2 for Modulation spectral features for speech emotion recognition using deep neural networks
Figure 3 for Modulation spectral features for speech emotion recognition using deep neural networks
Figure 4 for Modulation spectral features for speech emotion recognition using deep neural networks
Viaarxiv icon

Calibrating Transformers via Sparse Gaussian Processes

Add code
Bookmark button
Alert button
Mar 04, 2023
Wenlong Chen, Yingzhen Li

Figure 1 for Calibrating Transformers via Sparse Gaussian Processes
Figure 2 for Calibrating Transformers via Sparse Gaussian Processes
Figure 3 for Calibrating Transformers via Sparse Gaussian Processes
Figure 4 for Calibrating Transformers via Sparse Gaussian Processes
Viaarxiv icon