Alert button

"speech recognition": models, code, and papers
Alert button

StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR

Add code
Bookmark button
Alert button
Jul 15, 2021
Hirofumi Inaguma, Tatsuya Kawahara

Figure 1 for StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Figure 2 for StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Figure 3 for StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Figure 4 for StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Viaarxiv icon

Game of Gradients: Mitigating Irrelevant Clients in Federated Learning

Add code
Bookmark button
Alert button
Oct 23, 2021
Lokesh Nagalapatti, Ramasuri Narayanam

Figure 1 for Game of Gradients: Mitigating Irrelevant Clients in Federated Learning
Figure 2 for Game of Gradients: Mitigating Irrelevant Clients in Federated Learning
Figure 3 for Game of Gradients: Mitigating Irrelevant Clients in Federated Learning
Figure 4 for Game of Gradients: Mitigating Irrelevant Clients in Federated Learning
Viaarxiv icon

ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling

Add code
Bookmark button
Alert button
Jun 15, 2021
Ashish Shenoy, Sravan Bodapati, Katrin Kirchhoff

Figure 1 for ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Figure 2 for ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Figure 3 for ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Figure 4 for ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling
Viaarxiv icon

A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech

Jun 15, 2021
Pu Wang, Bagher BabaAli, Hugo Van hamme

Figure 1 for A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Figure 2 for A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Figure 3 for A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Figure 4 for A Study into Pre-training Strategies for Spoken Language Understanding on Dysarthric Speech
Viaarxiv icon

Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation

May 14, 2021
Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir

Figure 1 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 2 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 3 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Figure 4 for Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Viaarxiv icon

Data augmentation using prosody and false starts to recognize non-native children's speech

Add code
Bookmark button
Alert button
Aug 29, 2020
Hemant Kathania, Mittul Singh, Tamás Grósz, Mikko Kurimo

Figure 1 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 2 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 3 for Data augmentation using prosody and false starts to recognize non-native children's speech
Figure 4 for Data augmentation using prosody and false starts to recognize non-native children's speech
Viaarxiv icon

Low-activity supervised convolutional spiking neural networks applied to speech commands recognition

Add code
Bookmark button
Alert button
Nov 13, 2020
Thomas Pellegrini, Romain Zimmer, Timothée Masquelier

Figure 1 for Low-activity supervised convolutional spiking neural networks applied to speech commands recognition
Figure 2 for Low-activity supervised convolutional spiking neural networks applied to speech commands recognition
Figure 3 for Low-activity supervised convolutional spiking neural networks applied to speech commands recognition
Figure 4 for Low-activity supervised convolutional spiking neural networks applied to speech commands recognition
Viaarxiv icon

Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches

Jul 07, 2022
Tusarkanta Dalai, Tapas Kumar Mishra, Pankaj K Sa

Figure 1 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 2 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 3 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Figure 4 for Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches
Viaarxiv icon

Quantization of Deep Neural Networks for Accurate EdgeComputing

Apr 25, 2021
Wentao Chen, Hailong Qiu, Jian Zhuang, Chutong Zhang, Yu Hu, Qing Lu, Tianchen Wang, Yiyu Shi†, Meiping Huang, Xiaowe Xu

Figure 1 for Quantization of Deep Neural Networks for Accurate EdgeComputing
Figure 2 for Quantization of Deep Neural Networks for Accurate EdgeComputing
Figure 3 for Quantization of Deep Neural Networks for Accurate EdgeComputing
Figure 4 for Quantization of Deep Neural Networks for Accurate EdgeComputing
Viaarxiv icon

Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation

Aug 04, 2021
Seongmin Park, Dongchan Shin, Sangyoun Paik, Subong Choi, Alena Kazakova, Jihwa Lee

Figure 1 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 2 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 3 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 4 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Viaarxiv icon