speech


Assessing the Alignment of Audio Representations with Timbre Similarity Ratings

Add code
Jul 10, 2025
Viaarxiv icon

Code-Switching in End-to-End Automatic Speech Recognition: A Systematic Literature Review

Add code
Jul 10, 2025
Viaarxiv icon

Generic Speech Enhancement with Self-Supervised Representation Space Loss

Add code
Jul 10, 2025
Viaarxiv icon

DMF2Mel: A Dynamic Multiscale Fusion Network for EEG-Driven Mel Spectrogram Reconstruction

Add code
Jul 10, 2025
Viaarxiv icon

IML-Spikeformer: Input-aware Multi-Level Spiking Transformer for Speech Processing

Add code
Jul 10, 2025
Viaarxiv icon

Incremental Averaging Method to Improve Graph-Based Time-Difference-of-Arrival Estimation

Add code
Jul 09, 2025
Viaarxiv icon

A Novel Hybrid Deep Learning Technique for Speech Emotion Detection using Feature Engineering

Add code
Jul 09, 2025
Viaarxiv icon

Democratizing High-Fidelity Co-Speech Gesture Video Generation

Add code
Jul 09, 2025
Viaarxiv icon

Speech Tokenizer is Key to Consistent Representation

Add code
Jul 09, 2025
Viaarxiv icon

Revealing the Hidden Temporal Structure of HubertSoft Embeddings based on the Russian Phonetic Corpus

Add code
Jul 09, 2025
Viaarxiv icon