speech


Characterizing Mamba's Selective Memory using Auto-Encoders

Add code
Dec 17, 2025
Figure 1 for Characterizing Mamba's Selective Memory using Auto-Encoders
Figure 2 for Characterizing Mamba's Selective Memory using Auto-Encoders
Figure 3 for Characterizing Mamba's Selective Memory using Auto-Encoders
Figure 4 for Characterizing Mamba's Selective Memory using Auto-Encoders
Viaarxiv icon

Chorus: Harmonizing Context and Sensing Signals for Data-Free Model Customization in IoT

Add code
Dec 17, 2025
Viaarxiv icon

The Moralization Corpus: Frame-Based Annotation and Analysis of Moralizing Speech Acts across Diverse Text Genres

Add code
Dec 17, 2025
Viaarxiv icon

DASH: Dialogue-Aware Similarity and Handshake Recognition for Topic Segmentation in Public-Channel Conversations

Add code
Dec 17, 2025
Viaarxiv icon

From Minutes to Days: Scaling Intracranial Speech Decoding with Supervised Pretraining

Add code
Dec 17, 2025
Viaarxiv icon

Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics

Add code
Dec 17, 2025
Figure 1 for Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics
Figure 2 for Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics
Figure 3 for Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics
Figure 4 for Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics
Viaarxiv icon

On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation

Add code
Dec 17, 2025
Figure 1 for On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
Figure 2 for On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
Figure 3 for On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
Figure 4 for On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
Viaarxiv icon

From Signal to Turn: Interactional Friction in Modular Speech-to-Speech Pipelines

Add code
Dec 17, 2025
Viaarxiv icon

ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body

Add code
Dec 16, 2025
Viaarxiv icon

Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting

Add code
Dec 16, 2025
Viaarxiv icon