Alert button

"speech": models, code, and papers
Alert button

Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed FP16-INT8 Post-Training Quantization

Oct 14, 2022
Manuele Rusci, Marco Fariselli, Martin Croome, Francesco Paci, Eric Flamand

Figure 1 for Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed FP16-INT8 Post-Training Quantization
Figure 2 for Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed FP16-INT8 Post-Training Quantization
Figure 3 for Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed FP16-INT8 Post-Training Quantization
Figure 4 for Accelerating RNN-based Speech Enhancement on a Multi-Core MCU with Mixed FP16-INT8 Post-Training Quantization
Viaarxiv icon

Quantifying How Hateful Communities Radicalize Online Users

Sep 19, 2022
Matheus Schmitz, Keith Burghardt, Goran Muric

Figure 1 for Quantifying How Hateful Communities Radicalize Online Users
Figure 2 for Quantifying How Hateful Communities Radicalize Online Users
Figure 3 for Quantifying How Hateful Communities Radicalize Online Users
Figure 4 for Quantifying How Hateful Communities Radicalize Online Users
Viaarxiv icon

Speech Toxicity Analysis: A New Spoken Language Processing Task

Add code
Bookmark button
Alert button
Nov 06, 2021
Sreyan Ghosh, Samden Lepcha, S Sakshi, Rajiv Ratn Shah

Figure 1 for Speech Toxicity Analysis: A New Spoken Language Processing Task
Figure 2 for Speech Toxicity Analysis: A New Spoken Language Processing Task
Figure 3 for Speech Toxicity Analysis: A New Spoken Language Processing Task
Figure 4 for Speech Toxicity Analysis: A New Spoken Language Processing Task
Viaarxiv icon

Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition

Feb 07, 2022
Bethan Thomas, Samuel Kessler, Salah Karout

Figure 1 for Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 2 for Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 3 for Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 4 for Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition
Viaarxiv icon

Multitask Learning for Low Resource Spoken Language Understanding

Nov 24, 2022
Quentin Meeus, Marie-Francine Moens, Hugo Van hamme

Figure 1 for Multitask Learning for Low Resource Spoken Language Understanding
Figure 2 for Multitask Learning for Low Resource Spoken Language Understanding
Figure 3 for Multitask Learning for Low Resource Spoken Language Understanding
Figure 4 for Multitask Learning for Low Resource Spoken Language Understanding
Viaarxiv icon

Adaptive multilingual speech recognition with pretrained models

Add code
Bookmark button
Alert button
May 24, 2022
Ngoc-Quan Pham, Alex Waibel, Jan Niehues

Figure 1 for Adaptive multilingual speech recognition with pretrained models
Viaarxiv icon

Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design

Feb 11, 2023
Lyle Regenwetter, Akash Srivastava, Dan Gutfreund, Faez Ahmed

Figure 1 for Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Figure 2 for Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Figure 3 for Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Figure 4 for Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design
Viaarxiv icon

MTTM: Metamorphic Testing for Textual Content Moderation Software

Add code
Bookmark button
Alert button
Feb 11, 2023
Wenxuan Wang, Jen-tse Huang, Weibin Wu, Jianping Zhang, Yizhan Huang, Shuqing Li, Pinjia He, Michael Lyu

Figure 1 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 2 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 3 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 4 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Viaarxiv icon

Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification

Mar 30, 2022
Yikang Wang, Hiromitsu Nishizaki

Figure 1 for Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification
Figure 2 for Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification
Figure 3 for Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification
Figure 4 for Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification
Viaarxiv icon

Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction

Add code
Bookmark button
Alert button
Apr 05, 2022
Helard Becerra, Alessandro Ragano, Andrew Hines

Figure 1 for Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
Figure 2 for Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
Figure 3 for Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
Viaarxiv icon