Alert button

"speech": models, code, and papers
Alert button

IruMozhi: Automatically classifying diglossia in Tamil

Nov 13, 2023
Kabilan Prasanna, Aryaman Arora

Viaarxiv icon

Modelling prospective memory and resilient situated communications via Wizard of Oz

Nov 09, 2023
Yanzhe Li, Frank Broz, Mark Neerincx

Viaarxiv icon

Speech-dependent Modeling of Own Voice Transfer Characteristics for In-ear Microphones in Hearables

Sep 15, 2023
Mattes Ohlenbusch, Christian Rollwage, Simon Doclo

Viaarxiv icon

Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer

Nov 15, 2023
Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma

Viaarxiv icon

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding

Nov 19, 2023
Xuxin Cheng, Bowen Cao, Qichen Ye, Zhihong Zhu, Hongxiang Li, Yuexian Zou

Viaarxiv icon

RoDia: A New Dataset for Romanian Dialect Identification from Speech

Sep 12, 2023
Codrut Rotaru, Nicolae-Catalin Ristea, Radu Tudor Ionescu

Figure 1 for RoDia: A New Dataset for Romanian Dialect Identification from Speech
Figure 2 for RoDia: A New Dataset for Romanian Dialect Identification from Speech
Figure 3 for RoDia: A New Dataset for Romanian Dialect Identification from Speech
Figure 4 for RoDia: A New Dataset for Romanian Dialect Identification from Speech
Viaarxiv icon

Investigating the use of publicly available natural videos to learn Dynamic MR image reconstruction

Nov 23, 2023
Olivier Jaubert, Michele Pascale, Javier Montalt-Tordera, Julius Akesson, Ruta Virsinskaite, Daniel Knight, Simon Arridge, Jennifer Steeden, Vivek Muthurangu

Viaarxiv icon

GRASS: Unified Generation Model for Speech Semantic Understanding

Sep 06, 2023
Aobo Xia, Shuyu Lei, Yushu Yang, Xiang Guo, Hua Chai

Figure 1 for GRASS: Unified Generation Model for Speech Semantic Understanding
Figure 2 for GRASS: Unified Generation Model for Speech Semantic Understanding
Figure 3 for GRASS: Unified Generation Model for Speech Semantic Understanding
Viaarxiv icon

Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter

Sep 19, 2023
Song Li, Yongbin You, Xuezhi Wang, Ke Ding, Guanglu Wan

Figure 1 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 2 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 3 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 4 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Viaarxiv icon

Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services

Oct 06, 2023
Dasol Choi, Jooyoung Song, Eunsun Lee, Jinwoo Seo, Heejune Park, Dongbin Na

Figure 1 for Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services
Figure 2 for Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services
Figure 3 for Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services
Figure 4 for Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services
Viaarxiv icon