Alert button

"speech recognition": models, code, and papers
Alert button

ESPnet-ST IWSLT 2021 Offline Speech Translation System

Add code
Bookmark button
Alert button
Jul 06, 2021
Hirofumi Inaguma, Brian Yan, Siddharth Dalmia, Pengcheng Guo, Jiatong Shi, Kevin Duh, Shinji Watanabe

Figure 1 for ESPnet-ST IWSLT 2021 Offline Speech Translation System
Figure 2 for ESPnet-ST IWSLT 2021 Offline Speech Translation System
Figure 3 for ESPnet-ST IWSLT 2021 Offline Speech Translation System
Figure 4 for ESPnet-ST IWSLT 2021 Offline Speech Translation System
Viaarxiv icon

Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models

Add code
Bookmark button
Alert button
Jun 24, 2022
Hang Ji, Tanvina Patel, Odette Scharenborg

Figure 1 for Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models
Figure 2 for Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models
Figure 3 for Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models
Figure 4 for Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models
Viaarxiv icon

Improving a neural network model by explanation-guided training for glioma classification based on MRI data

Jul 05, 2021
Frantisek Sefcik, Wanda Benesova

Figure 1 for Improving a neural network model by explanation-guided training for glioma classification based on MRI data
Figure 2 for Improving a neural network model by explanation-guided training for glioma classification based on MRI data
Figure 3 for Improving a neural network model by explanation-guided training for glioma classification based on MRI data
Figure 4 for Improving a neural network model by explanation-guided training for glioma classification based on MRI data
Viaarxiv icon

Auxiliary Sequence Labeling Tasks for Disfluency Detection

Add code
Bookmark button
Alert button
Oct 24, 2020
Dongyub Lee, Byeongil Ko, Myeong Cheol Shin, Taesun Whang, Daniel Lee, Eun Hwa Kim, EungGyun Kim, Jaechoon Jo

Figure 1 for Auxiliary Sequence Labeling Tasks for Disfluency Detection
Figure 2 for Auxiliary Sequence Labeling Tasks for Disfluency Detection
Figure 3 for Auxiliary Sequence Labeling Tasks for Disfluency Detection
Figure 4 for Auxiliary Sequence Labeling Tasks for Disfluency Detection
Viaarxiv icon

Graph Neural Networks: Methods, Applications, and Opportunities

Aug 24, 2021
Lilapati Waikhom, Ripon Patgiri

Figure 1 for Graph Neural Networks: Methods, Applications, and Opportunities
Figure 2 for Graph Neural Networks: Methods, Applications, and Opportunities
Figure 3 for Graph Neural Networks: Methods, Applications, and Opportunities
Figure 4 for Graph Neural Networks: Methods, Applications, and Opportunities
Viaarxiv icon

Lightweight Adapter Tuning for Multilingual Speech Translation

Add code
Bookmark button
Alert button
Jun 02, 2021
Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier

Figure 1 for Lightweight Adapter Tuning for Multilingual Speech Translation
Figure 2 for Lightweight Adapter Tuning for Multilingual Speech Translation
Figure 3 for Lightweight Adapter Tuning for Multilingual Speech Translation
Figure 4 for Lightweight Adapter Tuning for Multilingual Speech Translation
Viaarxiv icon

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation

Add code
Bookmark button
Alert button
Jan 02, 2021
Changhan Wang, Morgane Rivière, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary Williamson, Juan Pino, Emmanuel Dupoux

Figure 1 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 2 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 3 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 4 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Viaarxiv icon

Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions

May 31, 2021
Roman Bedyakin, Nikolay Mikhaylovskiy

Figure 1 for Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions
Figure 2 for Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions
Figure 3 for Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions
Figure 4 for Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions
Viaarxiv icon

Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR

Add code
Bookmark button
Alert button
May 31, 2021
Shammur Absar Chowdhury, Amir Hussein, Ahmed Abdelali, Ahmed Ali

Figure 1 for Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR
Figure 2 for Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR
Figure 3 for Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR
Figure 4 for Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR
Viaarxiv icon

Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge

Sep 21, 2015
Xiaofei Wang, Chao Wu, Pengyuan Zhang, Ziteng Wang, Yong Liu, Xu Li, Qiang Fu, Yonghong Yan

Figure 1 for Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge
Figure 2 for Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge
Figure 3 for Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge
Figure 4 for Noise Robust IOA/CAS Speech Separation and Recognition System For The Third 'CHIME' Challenge
Viaarxiv icon