Alert button

"speech recognition": models, code, and papers
Alert button

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

Dec 20, 2022
Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Ignatius, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Noor Fatyanosa, Ziwei Ji, Pascale Fung, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti

Figure 1 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 2 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 3 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 4 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Viaarxiv icon

Improving EEG based Continuous Speech Recognition

Nov 24, 2019
Gautam Krishna, Co Tran, Mason Carnahan, Yan Han, Ahmed H Tewfik

Figure 1 for Improving EEG based Continuous Speech Recognition
Figure 2 for Improving EEG based Continuous Speech Recognition
Figure 3 for Improving EEG based Continuous Speech Recognition
Figure 4 for Improving EEG based Continuous Speech Recognition
Viaarxiv icon

Streaming Punctuation for Long-form Dictation with Transformers

Oct 11, 2022
Piyush Behre, Sharman Tan, Padma Varadharajan, Shuangyu Chang

Figure 1 for Streaming Punctuation for Long-form Dictation with Transformers
Figure 2 for Streaming Punctuation for Long-form Dictation with Transformers
Figure 3 for Streaming Punctuation for Long-form Dictation with Transformers
Figure 4 for Streaming Punctuation for Long-form Dictation with Transformers
Viaarxiv icon

Fine-Grained Grounding for Multimodal Speech Recognition

Oct 05, 2020
Tejas Srinivasan, Ramon Sanabria, Florian Metze, Desmond Elliott

Figure 1 for Fine-Grained Grounding for Multimodal Speech Recognition
Figure 2 for Fine-Grained Grounding for Multimodal Speech Recognition
Figure 3 for Fine-Grained Grounding for Multimodal Speech Recognition
Figure 4 for Fine-Grained Grounding for Multimodal Speech Recognition
Viaarxiv icon

Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction

Nov 23, 2022
Kai Shen, Yichong Leng, Xu Tan, Siliang Tang, Yuan Zhang, Wenjie Liu, Edward Lin

Figure 1 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 2 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 3 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Figure 4 for Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
Viaarxiv icon

Feature Selection Enhancement and Feature Space Visualization for Speech-Based Emotion Recognition

Aug 19, 2022
Sofia Kanwal, Sohail Asghar, Hazrat Ali

Figure 1 for Feature Selection Enhancement and Feature Space Visualization for Speech-Based Emotion Recognition
Figure 2 for Feature Selection Enhancement and Feature Space Visualization for Speech-Based Emotion Recognition
Figure 3 for Feature Selection Enhancement and Feature Space Visualization for Speech-Based Emotion Recognition
Figure 4 for Feature Selection Enhancement and Feature Space Visualization for Speech-Based Emotion Recognition
Viaarxiv icon

Regeneration Learning: A Learning Paradigm for Data Generation

Jan 21, 2023
Xu Tan, Tao Qin, Jiang Bian, Tie-Yan Liu, Yoshua Bengio

Figure 1 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 2 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 3 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 4 for Regeneration Learning: A Learning Paradigm for Data Generation
Viaarxiv icon

VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition

Feb 22, 2022
Jinhan Wang, Xiaosu Tong, Jinxi Guo, Di He, Roland Maas

Figure 1 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 2 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 3 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 4 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Viaarxiv icon

Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition

Mar 27, 2022
Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee

Figure 1 for Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition
Figure 2 for Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition
Figure 3 for Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition
Figure 4 for Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition
Viaarxiv icon

Neuromorphic spintronics accelerated by an unconventional data-driven Thiele equation approach

Jan 26, 2023
Anatole Moureaux, Simon De Wergifosse, Chloé Chopin, Jimmy Weber, Flavio Abreu Araujo

Figure 1 for Neuromorphic spintronics accelerated by an unconventional data-driven Thiele equation approach
Figure 2 for Neuromorphic spintronics accelerated by an unconventional data-driven Thiele equation approach
Figure 3 for Neuromorphic spintronics accelerated by an unconventional data-driven Thiele equation approach
Figure 4 for Neuromorphic spintronics accelerated by an unconventional data-driven Thiele equation approach
Viaarxiv icon