Alert button

"speech": models, code, and papers
Alert button

RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer

Add code
Bookmark button
Alert button
Jun 09, 2021
Xingshan Zeng, Liangyou Li, Qun Liu

Figure 1 for RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer
Figure 2 for RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer
Figure 3 for RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer
Figure 4 for RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer
Viaarxiv icon

Speaker activity driven neural speech extraction

Feb 09, 2021
Marc Delcroix, Katerina Zmolikova, Tsubasa Ochiai, Keisuke Kinoshita, Tomohiro Nakatani

Figure 1 for Speaker activity driven neural speech extraction
Figure 2 for Speaker activity driven neural speech extraction
Figure 3 for Speaker activity driven neural speech extraction
Viaarxiv icon

Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech

Add code
Bookmark button
Alert button
May 10, 2021
Pengwei Wang, Xin Ye, Xiaohuan Zhou, Jinghui Xie, Hao Wang

Figure 1 for Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Figure 2 for Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Figure 3 for Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Figure 4 for Speech2Slot: An End-to-End Knowledge-based Slot Filling from Speech
Viaarxiv icon

Joint magnitude estimation and phase recovery using Cyle-in-cycle GAN for non-parallel speech enhancement

Add code
Bookmark button
Alert button
Sep 26, 2021
Guochen Yu, Andong Li, Yutian Wang, Yinuo Guo, Chengshi Zheng, Hui Wang

Figure 1 for Joint magnitude estimation and phase recovery using Cyle-in-cycle GAN for non-parallel speech enhancement
Figure 2 for Joint magnitude estimation and phase recovery using Cyle-in-cycle GAN for non-parallel speech enhancement
Figure 3 for Joint magnitude estimation and phase recovery using Cyle-in-cycle GAN for non-parallel speech enhancement
Figure 4 for Joint magnitude estimation and phase recovery using Cyle-in-cycle GAN for non-parallel speech enhancement
Viaarxiv icon

Cross-lingual Hate Speech Detection using Transformer Models

Nov 01, 2021
Teodor Tiţa, Arkaitz Zubiaga

Figure 1 for Cross-lingual Hate Speech Detection using Transformer Models
Figure 2 for Cross-lingual Hate Speech Detection using Transformer Models
Figure 3 for Cross-lingual Hate Speech Detection using Transformer Models
Figure 4 for Cross-lingual Hate Speech Detection using Transformer Models
Viaarxiv icon

Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition

Apr 19, 2022
Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn Schuller

Figure 1 for Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
Figure 2 for Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
Figure 3 for Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
Figure 4 for Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition
Viaarxiv icon

Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR

Oct 11, 2022
Dongseong Hwang, Khe Chai Sim, Yu Zhang, Trevor Strohman

Figure 1 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 2 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 3 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Figure 4 for Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR
Viaarxiv icon

ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition

Sep 29, 2022
Martin Radfar, Rohit Barnwal, Rupak Vignesh Swaminathan, Feng-Ju Chang, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris

Figure 1 for ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Figure 2 for ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Figure 3 for ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Figure 4 for ConvRNN-T: Convolutional Augmented Recurrent Neural Network Transducers for Streaming Speech Recognition
Viaarxiv icon

Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition

Jul 26, 2021
Samuel Kessler, Bethan Thomas, Salah Karout

Figure 1 for Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition
Figure 2 for Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition
Figure 3 for Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition
Figure 4 for Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition
Viaarxiv icon

A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings

Add code
Bookmark button
Alert button
Apr 01, 2022
Fan Yu, Zhihao Du, Shiliang Zhang, Yuxiao Lin, Lei Xie

Figure 1 for A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Figure 2 for A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Figure 3 for A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Figure 4 for A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings
Viaarxiv icon