Alert button

"speech recognition": models, code, and papers
Alert button

Out-of-Distribution Representation Learning for Time Series Classification

Add code
Bookmark button
Alert button
Sep 26, 2022
Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xing Xie

Figure 1 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 2 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 3 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 4 for Out-of-Distribution Representation Learning for Time Series Classification
Viaarxiv icon

Deep Graph Random Process for Relational-Thinking-Based Speech Recognition

Jul 04, 2020
Hengguan Huang, Fuzhao Xue, Hao Wang, Ye Wang

Figure 1 for Deep Graph Random Process for Relational-Thinking-Based Speech Recognition
Figure 2 for Deep Graph Random Process for Relational-Thinking-Based Speech Recognition
Figure 3 for Deep Graph Random Process for Relational-Thinking-Based Speech Recognition
Figure 4 for Deep Graph Random Process for Relational-Thinking-Based Speech Recognition
Viaarxiv icon

Distribution Aware Metrics for Conditional Natural Language Generation

Sep 29, 2022
David M Chan, Yiming Ni, David A Ross, Sudheendra Vijayanarasimhan, Austin Myers, John Canny

Figure 1 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 2 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 3 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 4 for Distribution Aware Metrics for Conditional Natural Language Generation
Viaarxiv icon

Model Blending for Text Classification

Aug 05, 2022
Ramit Pahwa

Figure 1 for Model Blending for Text Classification
Figure 2 for Model Blending for Text Classification
Figure 3 for Model Blending for Text Classification
Figure 4 for Model Blending for Text Classification
Viaarxiv icon

Improving End-to-End Speech Recognition with Policy Learning

Dec 19, 2017
Yingbo Zhou, Caiming Xiong, Richard Socher

Figure 1 for Improving End-to-End Speech Recognition with Policy Learning
Figure 2 for Improving End-to-End Speech Recognition with Policy Learning
Figure 3 for Improving End-to-End Speech Recognition with Policy Learning
Figure 4 for Improving End-to-End Speech Recognition with Policy Learning
Viaarxiv icon

Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition

Add code
Bookmark button
Alert button
Feb 28, 2021
Hirofumi Inaguma, Tatsuya Kawahara

Figure 1 for Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Figure 2 for Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Figure 3 for Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Figure 4 for Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Viaarxiv icon

Exploring Neural Transducers for End-to-End Speech Recognition

Jul 24, 2017
Eric Battenberg, Jitong Chen, Rewon Child, Adam Coates, Yashesh Gaur, Yi Li, Hairong Liu, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, Zhenyao Zhu

Figure 1 for Exploring Neural Transducers for End-to-End Speech Recognition
Figure 2 for Exploring Neural Transducers for End-to-End Speech Recognition
Figure 3 for Exploring Neural Transducers for End-to-End Speech Recognition
Figure 4 for Exploring Neural Transducers for End-to-End Speech Recognition
Viaarxiv icon

ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA

Feb 20, 2017
Song Han, Junlong Kang, Huizi Mao, Yiming Hu, Xin Li, Yubin Li, Dongliang Xie, Hong Luo, Song Yao, Yu Wang, Huazhong Yang, William J. Dally

Figure 1 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 2 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 3 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Figure 4 for ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Viaarxiv icon

SNRi Target Training for Joint Speech Enhancement and Recognition

Add code
Bookmark button
Alert button
Nov 01, 2021
Yuma Koizumi, Shigeki Karita, Arun Narayanan, Sankaran Panchapagesan, Michiel Bacchiani

Figure 1 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 2 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 3 for SNRi Target Training for Joint Speech Enhancement and Recognition
Figure 4 for SNRi Target Training for Joint Speech Enhancement and Recognition
Viaarxiv icon

Achieving Human Parity in Conversational Speech Recognition

Feb 17, 2017
W. Xiong, J. Droppo, X. Huang, F. Seide, M. Seltzer, A. Stolcke, D. Yu, G. Zweig

Figure 1 for Achieving Human Parity in Conversational Speech Recognition
Figure 2 for Achieving Human Parity in Conversational Speech Recognition
Figure 3 for Achieving Human Parity in Conversational Speech Recognition
Figure 4 for Achieving Human Parity in Conversational Speech Recognition
Viaarxiv icon