Alert button

"speech": models, code, and papers
Alert button

READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises

Add code
Bookmark button
Alert button
Feb 14, 2023
Chenglei Si, Zhengyan Zhang, Yingfa Chen, Xiaozhi Wang, Zhiyuan Liu, Maosong Sun

Figure 1 for READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Figure 2 for READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Figure 3 for READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Figure 4 for READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Viaarxiv icon

Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0

Aug 15, 2022
Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Csaba Zainkó, Géza Németh

Figure 1 for Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0
Figure 2 for Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0
Figure 3 for Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0
Figure 4 for Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0
Viaarxiv icon

TimelyFL: Heterogeneity-aware Asynchronous Federated Learning with Adaptive Partial Training

Apr 14, 2023
Tuo Zhang, Lei Gao, Sunwoo Lee, Mi Zhang, Salman Avestimehr

Figure 1 for TimelyFL: Heterogeneity-aware Asynchronous Federated Learning with Adaptive Partial Training
Figure 2 for TimelyFL: Heterogeneity-aware Asynchronous Federated Learning with Adaptive Partial Training
Figure 3 for TimelyFL: Heterogeneity-aware Asynchronous Federated Learning with Adaptive Partial Training
Figure 4 for TimelyFL: Heterogeneity-aware Asynchronous Federated Learning with Adaptive Partial Training
Viaarxiv icon

Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning

Nov 17, 2022
Brian Testa, Yi Xiao, Avery Gump, Asif Salekin

Figure 1 for Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning
Figure 2 for Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning
Figure 3 for Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning
Figure 4 for Privacy against Real-Time Speech Emotion Detection via Acoustic Adversarial Evasion of Machine Learning
Viaarxiv icon

Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals

Add code
Bookmark button
Alert button
Jun 22, 2022
Running Zhao, Jiangtao Yu, Tingle Li, Hang Zhao, Edith C. H. Ngai

Figure 1 for Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Figure 2 for Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Figure 3 for Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Figure 4 for Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals
Viaarxiv icon

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

Nov 02, 2022
Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang

Figure 1 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 2 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 3 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 4 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Viaarxiv icon

Active Learning of Non-semantic Speech Tasks with Pretrained Models

Add code
Bookmark button
Alert button
Nov 03, 2022
Harlin Lee, Aaqib Saeed, Andrea L. Bertozzi

Figure 1 for Active Learning of Non-semantic Speech Tasks with Pretrained Models
Figure 2 for Active Learning of Non-semantic Speech Tasks with Pretrained Models
Figure 3 for Active Learning of Non-semantic Speech Tasks with Pretrained Models
Figure 4 for Active Learning of Non-semantic Speech Tasks with Pretrained Models
Viaarxiv icon

UzbekTagger: The rule-based POS tagger for Uzbek language

Add code
Bookmark button
Alert button
Jan 30, 2023
Maksud Sharipov, Elmurod Kuriyozov, Ollabergan Yuldashev, Ogabek Sobirov

Figure 1 for UzbekTagger: The rule-based POS tagger for Uzbek language
Figure 2 for UzbekTagger: The rule-based POS tagger for Uzbek language
Figure 3 for UzbekTagger: The rule-based POS tagger for Uzbek language
Viaarxiv icon

Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification

Add code
Bookmark button
Alert button
Mar 27, 2023
Chunpu Xu, Jing Li

Figure 1 for Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification
Figure 2 for Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification
Figure 3 for Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification
Figure 4 for Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification
Viaarxiv icon

X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMs

Add code
Bookmark button
Alert button
Apr 05, 2023
Giacomo Pedretti, John Moon, Pedro Bruel, Sergey Serebryakov, Ron M. Roth, Luca Buonanno, Tobias Ziegler, Cong Xu, Martin Foltin, Paolo Faraboschi, Jim Ignowski, Catherine E. Graves

Figure 1 for X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMs
Figure 2 for X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMs
Figure 3 for X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMs
Figure 4 for X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMs
Viaarxiv icon