Alert button

"speech recognition": models, code, and papers
Alert button

Optimizing Speech Recognition For The Edge

Sep 26, 2019
Yuan Shangguan, Jian Li, Liang Qiao, Raziel Alvarez, Ian McGraw

Figure 1 for Optimizing Speech Recognition For The Edge
Figure 2 for Optimizing Speech Recognition For The Edge
Figure 3 for Optimizing Speech Recognition For The Edge
Figure 4 for Optimizing Speech Recognition For The Edge
Viaarxiv icon

ElectrodeNet -- A Deep Learning Based Sound Coding Strategy for Cochlear Implants

May 26, 2023
Enoch Hsin-Ho Huang, Rong Chao, Yu Tsao, Chao-Min Wu

Figure 1 for ElectrodeNet -- A Deep Learning Based Sound Coding Strategy for Cochlear Implants
Figure 2 for ElectrodeNet -- A Deep Learning Based Sound Coding Strategy for Cochlear Implants
Figure 3 for ElectrodeNet -- A Deep Learning Based Sound Coding Strategy for Cochlear Implants
Figure 4 for ElectrodeNet -- A Deep Learning Based Sound Coding Strategy for Cochlear Implants
Viaarxiv icon

Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition

Oct 19, 2021
Haozhe Chen, Weiming Zhang, Kunlin Liu, Kejiang Chen, Han Fang, Nenghai Yu

Figure 1 for Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition
Figure 2 for Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition
Figure 3 for Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition
Figure 4 for Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition
Viaarxiv icon

EasyASR: A Distributed Machine Learning Platform for End-to-end Automatic Speech Recognition

Add code
Bookmark button
Alert button
Sep 14, 2020
Chengyu Wang, Mengli Cheng, Xu Hu, Jun Huang

Figure 1 for EasyASR: A Distributed Machine Learning Platform for End-to-end Automatic Speech Recognition
Viaarxiv icon

Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries

May 20, 2021
Sukhdeep S. Sodhi, Ellie Ka-In Chio, Ambarish Jash, Santiago Ontañón, Ajit Apte, Ankit Kumar, Ayooluwakunmi Jeje, Dima Kuzmin, Harry Fung, Heng-Tze Cheng, Jon Effrat, Tarush Bali, Nitin Jindal, Pei Cao, Sarvjeet Singh, Senqiang Zhou, Tameen Khan, Amol Wankhede, Moustafa Alzantot, Allen Wu, Tushar Chandra

Figure 1 for Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Figure 2 for Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Figure 3 for Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Figure 4 for Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries
Viaarxiv icon

A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling

Mar 09, 2022
Yike Zhang, Xiaobing Feng, Yi Liu, Songjun Cao, Long Ma

Figure 1 for A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling
Figure 2 for A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling
Figure 3 for A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling
Figure 4 for A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling
Viaarxiv icon

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

Add code
Bookmark button
Alert button
Sep 09, 2020
Quan Wang, Ignacio Lopez Moreno, Mert Saglam, Kevin Wilson, Alan Chiao, Renjie Liu, Yanzhang He, Wei Li, Jason Pelecanos, Marily Nika, Alexander Gruenstein

Figure 1 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 2 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Figure 3 for VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Viaarxiv icon

Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Mar 12, 2021
Aleksandr Laptev, Andrei Andrusenko, Ivan Podluzhny, Anton Mitrofanov, Ivan Medennikov, Yuri Matveev

Figure 1 for Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition
Figure 2 for Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition
Figure 3 for Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition
Figure 4 for Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition
Viaarxiv icon

Fusing Wav2vec2.0 and BERT into End-to-end Model for Low-resource Speech Recognition

Add code
Bookmark button
Alert button
Jan 17, 2021
Cheng Yi, Shiyu Zhou, Bo Xu

Figure 1 for Fusing Wav2vec2.0 and BERT into End-to-end Model for Low-resource Speech Recognition
Figure 2 for Fusing Wav2vec2.0 and BERT into End-to-end Model for Low-resource Speech Recognition
Figure 3 for Fusing Wav2vec2.0 and BERT into End-to-end Model for Low-resource Speech Recognition
Figure 4 for Fusing Wav2vec2.0 and BERT into End-to-end Model for Low-resource Speech Recognition
Viaarxiv icon

Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition

May 14, 2021
Khin Me Me Chit, Laet Laet Lin

Figure 1 for Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Figure 2 for Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Figure 3 for Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Figure 4 for Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Viaarxiv icon