Alert button

"speech recognition": models, code, and papers
Alert button

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jan 04, 2024
David M. Chan, Shalini Ghosh, Hitesh Tulsiani, Ariya Rastrow, Björn Hoffmeister

Viaarxiv icon

Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search

Jan 19, 2024
Yui Sudo, Muhammad Shakeel, Yosuke Fukumoto, Yifan Peng, Shinji Watanabe

Viaarxiv icon

Listening to Multi-talker Conversations: Modular and End-to-end Perspectives

Feb 14, 2024
Desh Raj

Viaarxiv icon

Continuously Learning New Words in Automatic Speech Recognition

Jan 09, 2024
Christian Huber, Alexander Waibel

Viaarxiv icon

Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models

Jan 03, 2024
Rita Frieske, Bertram E. Shi

Figure 1 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 2 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 3 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 4 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Viaarxiv icon

Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

Feb 22, 2024
Rui Zhou, Xian Li, Ying Fang, Xiaofei Li

Viaarxiv icon

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition

Add code
Bookmark button
Alert button
Jan 19, 2024
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng

Viaarxiv icon

Introduction to speech recognition

Feb 01, 2024
Gabriel Dauphin

Viaarxiv icon

Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking

Mar 13, 2024
Ming Dong, Yujing Chen, Miao Zhang, Hao Sun, Tingting He

Figure 1 for Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking
Figure 2 for Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking
Figure 3 for Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking
Figure 4 for Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking
Viaarxiv icon

Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing

Add code
Bookmark button
Alert button
Feb 23, 2024
Jeong Hun Yeo, Seunghee Han, Minsu Kim, Yong Man Ro

Viaarxiv icon