speech


EnvX: Agentize Everything with Agentic AI

Add code
Sep 09, 2025
Viaarxiv icon

Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation

Add code
Sep 09, 2025
Viaarxiv icon

Continuous Audio Language Models

Add code
Sep 09, 2025
Viaarxiv icon

Spectral and Rhythm Feature Performance Evaluation for Category and Class Level Audio Classification with Deep Convolutional Neural Networks

Add code
Sep 09, 2025
Viaarxiv icon

A Bottom-up Framework with Language-universal Speech Attribute Modeling for Syllable-based ASR

Add code
Sep 09, 2025
Viaarxiv icon

AIxcellent Vibes at GermEval 2025 Shared Task on Candy Speech Detection: Improving Model Performance by Span-Level Training

Add code
Sep 09, 2025
Viaarxiv icon

Identifying and Calibrating Overconfidence in Noisy Speech Recognition

Add code
Sep 08, 2025
Viaarxiv icon

On the Same Wavelength? Evaluating Pragmatic Reasoning in Language Models across Broad Concepts

Add code
Sep 08, 2025
Viaarxiv icon

LLM Analysis of 150+ years of German Parliamentary Debates on Migration Reveals Shift from Post-War Solidarity to Anti-Solidarity in the Last Decade

Add code
Sep 08, 2025
Viaarxiv icon

The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties

Add code
Sep 08, 2025
Viaarxiv icon