Alert button

"speech recognition": models, code, and papers
Alert button

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Sep 26, 2023
Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko

Figure 1 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 2 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 3 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 4 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Viaarxiv icon

A Review of Hybrid and Ensemble in Deep Learning for Natural Language Processing

Dec 09, 2023
Jianguo Jia, Wen Liang, Youzhi Liang

Viaarxiv icon

Large Language Models for Autonomous Driving: Real-World Experiments

Dec 14, 2023
Can Cui, Zichong Yang, Yupeng Zhou, Yunsheng Ma, Juanwu Lu, Ziran Wang

Viaarxiv icon

Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition

Oct 17, 2023
Shahram Ghorbani, John H. L. Hansen

Viaarxiv icon

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Sep 20, 2023
Tiantian Feng, Ju Lin, Yiteng Huang, Weipeng He, Kaustubh Kalgaonkar, Niko Moritz, Li Wan, Xin Lei, Ming Sun, Frank Seide

Figure 1 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 2 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 3 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 4 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Viaarxiv icon

Collaborative Learning with Artificial Intelligence Speakers (CLAIS): Pre-Service Elementary Science Teachers' Responses to the Prototype

Dec 20, 2023
Gyeong-Geon Lee, Seonyeong Mun, Myeong-Kyeong Shin, Xiaoming Zhai

Viaarxiv icon

LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild

Add code
Bookmark button
Alert button
Nov 21, 2023
David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos

Viaarxiv icon

CPPF: A contextual and post-processing-free model for automatic speech recognition

Add code
Bookmark button
Alert button
Sep 21, 2023
Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon

Partial Rewriting for Multi-Stage ASR

Dec 08, 2023
Antoine Bruguier, David Qiu, Yanzhang He

Viaarxiv icon

A Multi-Task, Multi-Modal Approach for Predicting Categorical and Dimensional Emotions

Dec 31, 2023
Alex-Răzvan Ispas, Théo Deschamps-Berger, Laurence Devillers

Viaarxiv icon