speech


The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences

Add code
May 06, 2026
Viaarxiv icon

A Comparative Study of PyCaret AutoML and CNN-BiLSTM for Binary Hate Speech Detection in Indonesian Twitter

Add code
May 06, 2026
Viaarxiv icon

Spatial-Magnifier: Spatial upsampling for multichannel speech enhancement

Add code
May 06, 2026
Viaarxiv icon

TajikNLP: An Open-Source Toolkit for Comprehensive Text Processing of Tajik (Cyrillic Script)

Add code
May 06, 2026
Viaarxiv icon

Benchmarking POS Tagging for the Tajik Language: A Comparative Study of Neural Architectures on the TajPersParallel Corpus

Add code
May 06, 2026
Viaarxiv icon

JASTIN: Aligning LLMs for Zero-Shot Audio and Speech Evaluation via Natural Language Instructions

Add code
May 06, 2026
Viaarxiv icon

Audio-Visual Intelligence in Large Foundation Models

Add code
May 05, 2026
Viaarxiv icon

MiniMind-O Technical Report: An Open Small-Scale Speech-Native Omni Model

Add code
May 05, 2026
Viaarxiv icon

A Paradigm for Interpreting Metrics and Identifying Critical Errors in Automatic Speech Recognition

Add code
May 05, 2026
Viaarxiv icon

Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs

Add code
May 05, 2026
Viaarxiv icon