Alert button

"speech recognition": models, code, and papers
Alert button

Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference

Dec 15, 2023
Bartosz Wójcik, Alessio Devoto, Karol Pustelnik, Pasquale Minervini, Simone Scardapane

Viaarxiv icon

CPPF: A contextual and post-processing-free model for automatic speech recognition

Add code
Bookmark button
Alert button
Sep 21, 2023
Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon

Efficient Representation of the Activation Space in Deep Neural Networks

Dec 13, 2023
Tanya Akumu, Celia Cintas, Girmaw Abebe Tadesse, Adebayo Oshingbesan, Skyler Speakman, Edward McFowland III

Viaarxiv icon

Augmenting conformers with structured state space models for online speech recognition

Sep 15, 2023
Haozhe Shan, Albert Gu, Zhong Meng, Weiran Wang, Krzysztof Choromanski, Tara Sainath

Figure 1 for Augmenting conformers with structured state space models for online speech recognition
Figure 2 for Augmenting conformers with structured state space models for online speech recognition
Figure 3 for Augmenting conformers with structured state space models for online speech recognition
Figure 4 for Augmenting conformers with structured state space models for online speech recognition
Viaarxiv icon

Are Soft Prompts Good Zero-shot Learners for Speech Recognition?

Sep 18, 2023
Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter-Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma

Figure 1 for Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
Figure 2 for Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
Figure 3 for Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
Figure 4 for Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
Viaarxiv icon

SECap: Speech Emotion Captioning with Large Language Model

Dec 23, 2023
Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shixiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu

Viaarxiv icon

Creating Spoken Dialog Systems in Ultra-Low Resourced Settings

Dec 11, 2023
Moayad Elamin, Muhammad Omer, Yonas Chanie, Henslaac Ndlovu

Viaarxiv icon

Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals

Aug 16, 2023
Running Zhao, Jiangtao Yu, Hang Zhao, Edith C. H. Ngai

Figure 1 for Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals
Figure 2 for Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals
Figure 3 for Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals
Figure 4 for Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals
Viaarxiv icon

One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition

Add code
Bookmark button
Alert button
Oct 02, 2023
Samuele Cornell, Jee-weon Jung, Shinji Watanabe, Stefano Squartini

Figure 1 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Figure 2 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Figure 3 for One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
Viaarxiv icon

Mavericks at NADI 2023 Shared Task: Unravelling Regional Nuances through Dialect Identification using Transformer-based Approach

Nov 30, 2023
Vedant Deshpande, Yash Patwardhan, Kshitij Deshpande, Sudeep Mangalvedhekar, Ravindra Murumkar

Viaarxiv icon