Alert button

"speech recognition": models, code, and papers
Alert button

Spatio-Temporal Representation Learning Enhanced Source Cell-phone Recognition from Speech Recordings

Aug 25, 2022
Chunyan Zeng, Shixiong Feng, Zhifeng Wang, Xiangkui Wan, Yunfan Chen, Nan Zhao

Figure 1 for Spatio-Temporal Representation Learning Enhanced Source Cell-phone Recognition from Speech Recordings
Figure 2 for Spatio-Temporal Representation Learning Enhanced Source Cell-phone Recognition from Speech Recordings
Figure 3 for Spatio-Temporal Representation Learning Enhanced Source Cell-phone Recognition from Speech Recordings
Figure 4 for Spatio-Temporal Representation Learning Enhanced Source Cell-phone Recognition from Speech Recordings
Viaarxiv icon

Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification

Add code
Bookmark button
Alert button
Feb 15, 2021
Bidisha Sharma, Maulik Madhavi, Haizhou Li

Figure 1 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Figure 2 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Figure 3 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Figure 4 for Leveraging Acoustic and Linguistic Embeddings from Pretrained speech and language Models for Intent Classification
Viaarxiv icon

Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora

Sep 23, 2021
Szu-Jui Chen, Wei Xia, John H. L. Hansen

Figure 1 for Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora
Figure 2 for Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora
Figure 3 for Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora
Figure 4 for Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora
Viaarxiv icon

Learning Transferable Features for Speech Emotion Recognition

Dec 23, 2019
Alison Marczewski, Adriano Veloso, Nívio Ziviani

Figure 1 for Learning Transferable Features for Speech Emotion Recognition
Figure 2 for Learning Transferable Features for Speech Emotion Recognition
Figure 3 for Learning Transferable Features for Speech Emotion Recognition
Figure 4 for Learning Transferable Features for Speech Emotion Recognition
Viaarxiv icon

The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation

Add code
Bookmark button
Alert button
Aug 09, 2021
Minghan Wang, Yuxia Wang, Chang Su, Jiaxin Guo, Yingtao Zhang, Yujia Liu, Min Zhang, Shimin Tao, Xingshan Zeng, Liangyou Li, Hao Yang, Ying Qin

Figure 1 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 2 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 3 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Figure 4 for The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation
Viaarxiv icon

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Aug 04, 2021
Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexis Conneau, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli

Figure 1 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 2 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 3 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 4 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Viaarxiv icon

Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition

Aug 17, 2013
Md. Ali Hossain, Md. Mijanur Rahman, Uzzal Kumar Prodhan, Md. Farukuzzaman Khan

Figure 1 for Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition
Figure 2 for Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition
Figure 3 for Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition
Figure 4 for Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition
Viaarxiv icon

Sequence-level self-learning with multiple hypotheses

Dec 10, 2021
Kenichi Kumatani, Dimitrios Dimitriadis, Yashesh Gaur, Robert Gmyr, Sefik Emre Eskimez, Jinyu Li, Michael Zeng

Figure 1 for Sequence-level self-learning with multiple hypotheses
Figure 2 for Sequence-level self-learning with multiple hypotheses
Figure 3 for Sequence-level self-learning with multiple hypotheses
Figure 4 for Sequence-level self-learning with multiple hypotheses
Viaarxiv icon

An empirical assessment of deep learning approaches to task-oriented dialog management

Aug 07, 2021
Lukáš Matějů, David Griol, Zoraida Callejas, José Manuel Molina, Araceli Sanchis

Viaarxiv icon

A.I. based Embedded Speech to Text Using Deepspeech

Feb 25, 2020
Muhammad Hafidh Firmansyah, Anand Paul, Deblina Bhattacharya, Gul Malik Urfa

Figure 1 for A.I. based Embedded Speech to Text Using Deepspeech
Figure 2 for A.I. based Embedded Speech to Text Using Deepspeech
Figure 3 for A.I. based Embedded Speech to Text Using Deepspeech
Figure 4 for A.I. based Embedded Speech to Text Using Deepspeech
Viaarxiv icon