Alert button

"speech recognition": models, code, and papers
Alert button

Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity

Add code
Bookmark button
Alert button
Aug 11, 2021
Chang Gao, Tobi Delbruck, Shih-Chii Liu

Figure 1 for Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity
Figure 2 for Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity
Figure 3 for Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity
Figure 4 for Spartus: A 9.4 TOp/s FPGA-based LSTM Accelerator Exploiting Spatio-temporal Sparsity
Viaarxiv icon

Protecting gender and identity with disentangled speech representations

Apr 22, 2021
Dimitrios Stoidis, Andrea Cavallaro

Figure 1 for Protecting gender and identity with disentangled speech representations
Figure 2 for Protecting gender and identity with disentangled speech representations
Viaarxiv icon

Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens

Jun 23, 2021
Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura

Figure 1 for Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens
Figure 2 for Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens
Figure 3 for Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks using Switching Tokens
Viaarxiv icon

Disfluency Detection with Unlabeled Data and Small BERT Models

Apr 21, 2021
Johann C. Rocholl, Vicky Zayats, Daniel D. Walker, Noah B. Murad, Aaron Schneider, Daniel J. Liebling

Figure 1 for Disfluency Detection with Unlabeled Data and Small BERT Models
Figure 2 for Disfluency Detection with Unlabeled Data and Small BERT Models
Figure 3 for Disfluency Detection with Unlabeled Data and Small BERT Models
Figure 4 for Disfluency Detection with Unlabeled Data and Small BERT Models
Viaarxiv icon

Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks

Jan 30, 2021
Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S R Mahadeva Prasanna

Figure 1 for Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks
Figure 2 for Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks
Figure 3 for Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks
Figure 4 for Enhancing the Intelligibility of Cleft Lip and Palate Speech using Cycle-consistent Adversarial Networks
Viaarxiv icon

FedScale: Benchmarking Model and System Performance of Federated Learning

Add code
Bookmark button
Alert button
May 24, 2021
Fan Lai, Yinwei Dai, Xiangfeng Zhu, Mosharaf Chowdhury

Figure 1 for FedScale: Benchmarking Model and System Performance of Federated Learning
Figure 2 for FedScale: Benchmarking Model and System Performance of Federated Learning
Figure 3 for FedScale: Benchmarking Model and System Performance of Federated Learning
Figure 4 for FedScale: Benchmarking Model and System Performance of Federated Learning
Viaarxiv icon

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Add code
Bookmark button
Alert button
Jun 21, 2021
Gaurav Menghani

Figure 1 for Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Figure 2 for Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Figure 3 for Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Figure 4 for Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Viaarxiv icon

Convo: What does conversational programming need? An exploration of machine learning interface design

Mar 03, 2020
Jessica Van Brummelen, Kevin Weng, Phoebe Lin, Catherine Yeo

Figure 1 for Convo: What does conversational programming need? An exploration of machine learning interface design
Figure 2 for Convo: What does conversational programming need? An exploration of machine learning interface design
Figure 3 for Convo: What does conversational programming need? An exploration of machine learning interface design
Figure 4 for Convo: What does conversational programming need? An exploration of machine learning interface design
Viaarxiv icon

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning

Add code
Bookmark button
Alert button
Oct 27, 2020
Dongwei Jiang, Wubo Li, Miao Cao, Ruixiong Zhang, Wei Zou, Kun Han, Xiangang Li

Figure 1 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 2 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 3 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Figure 4 for Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Viaarxiv icon

Multi-task Learning with Cross Attention for Keyword Spotting

Jul 15, 2021
Takuya Higuchi, Anmol Gupta, Chandra Dhir

Figure 1 for Multi-task Learning with Cross Attention for Keyword Spotting
Figure 2 for Multi-task Learning with Cross Attention for Keyword Spotting
Figure 3 for Multi-task Learning with Cross Attention for Keyword Spotting
Figure 4 for Multi-task Learning with Cross Attention for Keyword Spotting
Viaarxiv icon