Alert button

"speech": models, code, and papers
Alert button

What has LeBenchmark Learnt about French Syntax?

Mar 04, 2024
Zdravko Dugonjić, Adrien Pupier, Benjamin Lecouteux, Maximin Coavoux

Figure 1 for What has LeBenchmark Learnt about French Syntax?
Figure 2 for What has LeBenchmark Learnt about French Syntax?
Figure 3 for What has LeBenchmark Learnt about French Syntax?
Figure 4 for What has LeBenchmark Learnt about French Syntax?
Viaarxiv icon

Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models

Mar 04, 2024
Sargam Yadav, Abhishek Kaushik, Kevin McDaid

Figure 1 for Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models
Figure 2 for Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models
Figure 3 for Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models
Figure 4 for Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models
Viaarxiv icon

Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition

Feb 20, 2024
Yang Li, Yuan Shangguan, Yuhao Wang, Liangzhen Lai, Ernie Chang, Changsheng Zhao, Yangyang Shi, Vikas Chandra

Viaarxiv icon

SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data

Feb 10, 2024
Hsuan-Fu Wang, Yi-Jen Shih, Heng-Jui Chang, Layne Berry, Puyuan Peng, Hung-yi Lee, Hsin-Min Wang, David Harwath

Viaarxiv icon

sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks

Mar 09, 2024
Qu Yang, Qianhui Liu, Nan Li, Meng Ge, Zeyang Song, Haizhou Li

Figure 1 for sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Figure 2 for sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Figure 3 for sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Figure 4 for sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Viaarxiv icon

MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank

Add code
Bookmark button
Alert button
Mar 15, 2024
Verena Blaschke, Barbara Kovačić, Siyao Peng, Hinrich Schütze, Barbara Plank

Figure 1 for MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
Figure 2 for MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
Figure 3 for MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
Figure 4 for MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank
Viaarxiv icon

Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

Feb 16, 2024
Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia, Eng Siong Chng, Lina Yao

Viaarxiv icon

Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models

Mar 17, 2024
Mohamed Taher Alrefaie, Nour Eldin Morsy, Nada Samir

Figure 1 for Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models
Figure 2 for Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models
Figure 3 for Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models
Figure 4 for Exploring Tokenization Strategies and Vocabulary Sizes for Enhanced Arabic Language Models
Viaarxiv icon

Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition

Feb 20, 2024
David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos

Viaarxiv icon

MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech

Add code
Bookmark button
Alert button
Feb 14, 2024
Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao

Viaarxiv icon