Alert button

"speech recognition": models, code, and papers
Alert button

Multimodal Speech Recognition for Language-Guided Embodied Agents

Add code
Bookmark button
Alert button
Feb 27, 2023
Allen Chang, Xiaoyuan Zhu, Aarav Monga, Seoho Ahn, Tejas Srinivasan, Jesse Thomason

Figure 1 for Multimodal Speech Recognition for Language-Guided Embodied Agents
Figure 2 for Multimodal Speech Recognition for Language-Guided Embodied Agents
Figure 3 for Multimodal Speech Recognition for Language-Guided Embodied Agents
Figure 4 for Multimodal Speech Recognition for Language-Guided Embodied Agents
Viaarxiv icon

Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques

Aug 04, 2023
Samiul Islam, Md. Maksudul Haque, Abu Jobayer Md. Sadat

Figure 1 for Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques
Figure 2 for Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques
Figure 3 for Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques
Figure 4 for Capturing Spectral and Long-term Contextual Information for Speech Emotion Recognition Using Deep Learning Techniques
Viaarxiv icon

Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes

May 12, 2023
Emma O'Neill, Julie Carson-Berndsen

Figure 1 for Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes
Figure 2 for Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes
Figure 3 for Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes
Figure 4 for Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes
Viaarxiv icon

I3D: Transformer architectures with input-dependent dynamic depth for speech recognition

Mar 14, 2023
Yifan Peng, Jaesong Lee, Shinji Watanabe

Figure 1 for I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Figure 2 for I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Figure 3 for I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Figure 4 for I3D: Transformer architectures with input-dependent dynamic depth for speech recognition
Viaarxiv icon

Enhancing multilingual speech recognition in air traffic control by sentence-level language identification

Apr 29, 2023
Peng Fan, Dongyue Guo, JianWei Zhang, Bo Yang, Yi Lin

Figure 1 for Enhancing multilingual speech recognition in air traffic control by sentence-level language identification
Figure 2 for Enhancing multilingual speech recognition in air traffic control by sentence-level language identification
Figure 3 for Enhancing multilingual speech recognition in air traffic control by sentence-level language identification
Figure 4 for Enhancing multilingual speech recognition in air traffic control by sentence-level language identification
Viaarxiv icon

Political corpus creation through automatic speech recognition on EU debates

Add code
Bookmark button
Alert button
Apr 17, 2023
Hugo de Vos, Suzan Verberne

Figure 1 for Political corpus creation through automatic speech recognition on EU debates
Figure 2 for Political corpus creation through automatic speech recognition on EU debates
Figure 3 for Political corpus creation through automatic speech recognition on EU debates
Figure 4 for Political corpus creation through automatic speech recognition on EU debates
Viaarxiv icon

A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique

Aug 09, 2023
Gokulprasath R

Figure 1 for A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique
Figure 2 for A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique
Figure 3 for A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique
Figure 4 for A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique
Viaarxiv icon

Naaloss: Rethinking the objective of speech enhancement

Add code
Bookmark button
Alert button
Aug 24, 2023
Kuan-Hsun Ho, En-Lun Yu, Jeih-weih Hung, Berlin Chen

Figure 1 for Naaloss: Rethinking the objective of speech enhancement
Figure 2 for Naaloss: Rethinking the objective of speech enhancement
Figure 3 for Naaloss: Rethinking the objective of speech enhancement
Figure 4 for Naaloss: Rethinking the objective of speech enhancement
Viaarxiv icon

From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition

Jan 19, 2023
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Rohit Prabhavalkar, Tara N. Sainath, Trevor Strohman

Figure 1 for From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Figure 2 for From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Figure 3 for From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Figure 4 for From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Viaarxiv icon

Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation

Add code
Bookmark button
Alert button
May 19, 2023
Martijn Bartelds, Nay San, Bradley McDonnell, Dan Jurafsky, Martijn Wieling

Figure 1 for Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
Figure 2 for Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
Figure 3 for Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
Figure 4 for Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
Viaarxiv icon