Alert button

"speech": models, code, and papers
Alert button

Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition

Jan 28, 2024
Ahnaf Mozib Samin

Viaarxiv icon

Exploring the Power of Pure Attention Mechanisms in Blind Room Parameter Estimation

Feb 25, 2024
Chunxi Wang, Maoshen Jia, Meiran Li, Changchun Bao, Wenyu Jin

Viaarxiv icon

Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline

Jan 26, 2024
Seonmin Koo, Chanjun Park, Jinsung Kim, Jaehyung Seo, Sugyeong Eo, Hyeonseok Moon, Heuiseok Lim

Viaarxiv icon

Aria Everyday Activities Dataset

Add code
Bookmark button
Alert button
Feb 22, 2024
Zhaoyang Lv, Nicholas Charron, Pierre Moulon, Alexander Gamino, Cheng Peng, Chris Sweeney, Edward Miller, Huixuan Tang, Jeff Meissner, Jing Dong, Kiran Somasundaram, Luis Pesqueira, Mark Schwesinger, Omkar Parkhi, Qiao Gu, Renzo De Nardi, Shangyi Cheng, Steve Saarinen, Vijay Baiyya, Yuyang Zou, Richard Newcombe, Jakob Julian Engel, Xiaqing Pan, Carl Ren

Viaarxiv icon

Listening to Multi-talker Conversations: Modular and End-to-end Perspectives

Feb 14, 2024
Desh Raj

Viaarxiv icon

Two-pass Endpoint Detection for Speech Recognition

Jan 17, 2024
Anirudh Raju, Aparna Khare, Di He, Ilya Sklyar, Long Chen, Sam Alptekin, Viet Anh Trinh, Zhe Zhang, Colin Vaz, Venkatesh Ravichandran, Roland Maas, Ariya Rastrow

Viaarxiv icon

Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for Comparative Opinion Mining from Vietnamese Product Reviews

Feb 21, 2024
Hoang-Quynh Le, Duy-Cat Can, Khanh-Vinh Nguyen, Mai-Vu Tran

Viaarxiv icon

The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese

Feb 12, 2024
Ajinkya Kulkarni, Anna Tokareva, Rameez Qureshi, Miguel Couceiro

Viaarxiv icon

Analysis and Detection of Multilingual Hate Speech Using Transformer Based Deep Learning

Jan 19, 2024
Arijit Das, Somashree Nandy, Rupam Saha, Srijan Das, Diganta Saha

Viaarxiv icon

CEV-LM: Controlled Edit Vector Language Model for Shaping Natural Language Generations

Add code
Bookmark button
Alert button
Feb 22, 2024
Samraj Moorjani, Adit Krishnan, Hari Sundaram

Viaarxiv icon