Alert button

"speech": models, code, and papers
Alert button

Sparse Modular Activation for Efficient Sequence Modeling

Add code
Bookmark button
Alert button
Jun 19, 2023
Liliang Ren, Yang Liu, Shuohang Wang, Yichong Xu, Chenguang Zhu, ChengXiang Zhai

Figure 1 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 2 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 3 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 4 for Sparse Modular Activation for Efficient Sequence Modeling
Viaarxiv icon

Analysis of Noisy-target Training for DNN-based speech enhancement

Nov 02, 2022
Takuya Fujimura, Tomoki Toda

Figure 1 for Analysis of Noisy-target Training for DNN-based speech enhancement
Figure 2 for Analysis of Noisy-target Training for DNN-based speech enhancement
Figure 3 for Analysis of Noisy-target Training for DNN-based speech enhancement
Figure 4 for Analysis of Noisy-target Training for DNN-based speech enhancement
Viaarxiv icon

Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices

Apr 22, 2023
Ahlam Husni Abu Nada, Siddique Latif, Junaid Qadir

Figure 1 for Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices
Figure 2 for Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices
Figure 3 for Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices
Figure 4 for Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices
Viaarxiv icon

DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech

Dec 08, 2022
Kazuki Kawamura, Jun Rekimoto

Figure 1 for DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech
Figure 2 for DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech
Figure 3 for DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech
Figure 4 for DDSupport: Language Learning Support System that Displays Differences and Distances from Model Speech
Viaarxiv icon

Exploring Effective Fusion Algorithms for Speech Based Self-Supervised Learning Models

Dec 20, 2022
Changli Tang, Yujin Wang, Xie Chen, Wei-Qiang Zhang

Figure 1 for Exploring Effective Fusion Algorithms for Speech Based Self-Supervised Learning Models
Figure 2 for Exploring Effective Fusion Algorithms for Speech Based Self-Supervised Learning Models
Figure 3 for Exploring Effective Fusion Algorithms for Speech Based Self-Supervised Learning Models
Figure 4 for Exploring Effective Fusion Algorithms for Speech Based Self-Supervised Learning Models
Viaarxiv icon

Efficient Speech Translation with Dynamic Latent Perceivers

Add code
Bookmark button
Alert button
Oct 28, 2022
Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussá

Figure 1 for Efficient Speech Translation with Dynamic Latent Perceivers
Figure 2 for Efficient Speech Translation with Dynamic Latent Perceivers
Figure 3 for Efficient Speech Translation with Dynamic Latent Perceivers
Figure 4 for Efficient Speech Translation with Dynamic Latent Perceivers
Viaarxiv icon

Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings

Add code
Bookmark button
Alert button
Nov 12, 2022
Karl El Hajal, Zihan Wu, Neil Scheidwasser-Clow, Gasser Elbanna, Milos Cernak

Figure 1 for Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Figure 2 for Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Figure 3 for Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Figure 4 for Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
Viaarxiv icon

Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition

Add code
Bookmark button
Alert button
Nov 22, 2022
Injy Hamed, Amir Hussein, Oumnia Chellah, Shammur Chowdhury, Hamdy Mubarak, Sunayana Sitaram, Nizar Habash, Ahmed Ali

Figure 1 for Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition
Figure 2 for Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition
Figure 3 for Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition
Figure 4 for Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition
Viaarxiv icon

Robust Speech Recognition via Large-Scale Weak Supervision

Add code
Bookmark button
Alert button
Dec 06, 2022
Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever

Figure 1 for Robust Speech Recognition via Large-Scale Weak Supervision
Figure 2 for Robust Speech Recognition via Large-Scale Weak Supervision
Figure 3 for Robust Speech Recognition via Large-Scale Weak Supervision
Figure 4 for Robust Speech Recognition via Large-Scale Weak Supervision
Viaarxiv icon

Model Extraction Attack against Self-supervised Speech Models

Nov 29, 2022
Tsu-Yuan Hsu, Chen-An Li, Tung-Yu Wu, Hung-yi Lee

Figure 1 for Model Extraction Attack against Self-supervised Speech Models
Figure 2 for Model Extraction Attack against Self-supervised Speech Models
Figure 3 for Model Extraction Attack against Self-supervised Speech Models
Figure 4 for Model Extraction Attack against Self-supervised Speech Models
Viaarxiv icon