Alert button

"speech recognition": models, code, and papers
Alert button

BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition

Oct 04, 2023
Peikun Chen, Fan Yu, Yuhao Lian, Hongfei Xue, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie

Figure 1 for BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
Figure 2 for BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
Figure 3 for BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
Figure 4 for BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition
Viaarxiv icon

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models

Add code
Bookmark button
Alert button
Oct 16, 2023
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Macro Siniscalchi, Pin-Yu Chen, Eng Siong Chng

Figure 1 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 2 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 3 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 4 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Viaarxiv icon

Batched Low-Rank Adaptation of Foundation Models

Dec 09, 2023
Yeming Wen, Swarat Chaudhuri

Figure 1 for Batched Low-Rank Adaptation of Foundation Models
Figure 2 for Batched Low-Rank Adaptation of Foundation Models
Figure 3 for Batched Low-Rank Adaptation of Foundation Models
Figure 4 for Batched Low-Rank Adaptation of Foundation Models
Viaarxiv icon

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Sep 26, 2023
Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko

Figure 1 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 2 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 3 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Figure 4 for Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition
Viaarxiv icon

Acoustic characterization of speech rhythm: going beyond metrics with recurrent neural networks

Jan 22, 2024
François Deloche, Laurent Bonnasse-Gahot, Judit Gervain

Viaarxiv icon

Optimizing Two-Pass Cross-Lingual Transfer Learning: Phoneme Recognition and Phoneme to Grapheme Translation

Dec 06, 2023
Wonjun Lee, Gary Geunbae Lee, Yunsu Kim

Viaarxiv icon

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Sep 20, 2023
Tiantian Feng, Ju Lin, Yiteng Huang, Weipeng He, Kaustubh Kalgaonkar, Niko Moritz, Li Wan, Xin Lei, Ming Sun, Frank Seide

Figure 1 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 2 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 3 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 4 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Viaarxiv icon

Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition

Oct 17, 2023
Shahram Ghorbani, John H. L. Hansen

Viaarxiv icon

Graph Convolutions Enrich the Self-Attention in Transformers!

Dec 07, 2023
Jeongwhan Choi, Hyowon Wi, Jayoung Kim, Yehjin Shin, Kookjin Lee, Nathaniel Trask, Noseong Park

Viaarxiv icon

A Review of Hybrid and Ensemble in Deep Learning for Natural Language Processing

Dec 09, 2023
Jianguo Jia, Wen Liang, Youzhi Liang

Viaarxiv icon