Alert button

"speech recognition": models, code, and papers
Alert button

CopyNE: Better Contextual ASR by Copying Named Entities

Add code
Bookmark button
Alert button
May 22, 2023
Shilin Zhou, Zhenghua Li, Yu Hong, Min Zhang, Zhefeng Wang, Baoxing Huai

Figure 1 for CopyNE: Better Contextual ASR by Copying Named Entities
Figure 2 for CopyNE: Better Contextual ASR by Copying Named Entities
Figure 3 for CopyNE: Better Contextual ASR by Copying Named Entities
Figure 4 for CopyNE: Better Contextual ASR by Copying Named Entities
Viaarxiv icon

Modular Domain Adaptation for Conformer-Based Streaming ASR

May 22, 2023
Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro M. Mengibar

Figure 1 for Modular Domain Adaptation for Conformer-Based Streaming ASR
Figure 2 for Modular Domain Adaptation for Conformer-Based Streaming ASR
Figure 3 for Modular Domain Adaptation for Conformer-Based Streaming ASR
Figure 4 for Modular Domain Adaptation for Conformer-Based Streaming ASR
Viaarxiv icon

Multimodal Audio-textual Architecture for Robust Spoken Language Understanding

Jun 13, 2023
Anderson R. Avila, Mehdi Rezagholizadeh, Chao Xing

Figure 1 for Multimodal Audio-textual Architecture for Robust Spoken Language Understanding
Figure 2 for Multimodal Audio-textual Architecture for Robust Spoken Language Understanding
Figure 3 for Multimodal Audio-textual Architecture for Robust Spoken Language Understanding
Figure 4 for Multimodal Audio-textual Architecture for Robust Spoken Language Understanding
Viaarxiv icon

Sub-8-bit quantization for on-device speech recognition: a regularization-free approach

Oct 17, 2022
Kai Zhen, Martin Radfar, Hieu Duy Nguyen, Grant P. Strimel, Nathan Susanj, Athanasios Mouchtaris

Figure 1 for Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
Figure 2 for Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
Figure 3 for Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
Figure 4 for Sub-8-bit quantization for on-device speech recognition: a regularization-free approach
Viaarxiv icon

Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents

Apr 03, 2022
Priyank Dubey, Bilal Shah

Viaarxiv icon

ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale

Jul 22, 2022
Gopinath Chennupati, Milind Rao, Gurpreet Chadha, Aaron Eakin, Anirudh Raju, Gautam Tiwari, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo, Andy Oberlin, Buddha Nandanoor, Prahalad Venkataramanan, Zheng Wu, Pankaj Sitpure

Figure 1 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 2 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 3 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 4 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Viaarxiv icon

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Add code
Bookmark button
Alert button
May 18, 2023
Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei Ping Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe

Figure 1 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 2 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Figure 3 for ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
Viaarxiv icon

Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction

May 21, 2023
Mohan Shi, Yuchun Shu, Lingyun Zuo, Qian Chen, Shiliang Zhang, Jie Zhang, Li-Rong Dai

Figure 1 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 2 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 3 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Figure 4 for Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
Viaarxiv icon

Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks

Nov 03, 2022
Zitha Sasindran, Harsha Yelchuri, Supreeth Rao, T. V. Prabhakar

Figure 1 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 2 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 3 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Figure 4 for Hybrid-SD ($\text{H}_{\text{SD}}$) : A new hybrid evaluation metric for automatic speech recognition tasks
Viaarxiv icon

Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition

Feb 15, 2022
Zi-Qiang Zhang, Jie Zhang, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai

Figure 1 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 2 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 3 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Figure 4 for Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition
Viaarxiv icon