speech


Acoustivision Pro: An Open-Source Interactive Platform for Room Impulse Response Analysis and Acoustic Characterization

Add code
Feb 11, 2026
Viaarxiv icon

SCRAPL: Scattering Transform with Random Paths for Machine Learning

Add code
Feb 11, 2026
Viaarxiv icon

From Diet to Free Lunch: Estimating Auxiliary Signal Properties using Dynamic Pruning Masks in Speech Enhancement Networks

Add code
Feb 11, 2026
Viaarxiv icon

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

AudioRAG: A Challenging Benchmark for Audio Reasoning and Information Retrieval

Add code
Feb 11, 2026
Viaarxiv icon

MerkleSpeech: Public-Key Verifiable, Chunk-Localised Speech Provenance via Perceptual Fingerprints and Merkle Commitments

Add code
Feb 10, 2026
Viaarxiv icon

Towards Training-free Multimodal Hate Localisation with Large Language Models

Add code
Feb 10, 2026
Viaarxiv icon

TVTSyn: Content-Synchronous Time-Varying Timbre for Streaming Voice Conversion and Anonymization

Add code
Feb 10, 2026
Viaarxiv icon

ViSpeechFormer: A Phonemic Approach for Vietnamese Automatic Speech Recognition

Add code
Feb 10, 2026
Viaarxiv icon

BioME: A Resource-Efficient Bioacoustic Foundational Model for IoT Applications

Add code
Feb 10, 2026
Viaarxiv icon