speech


Chorus: Harmonizing Context and Sensing Signals for Data-Free Model Customization in IoT

Add code
Dec 17, 2025
Viaarxiv icon

The Moralization Corpus: Frame-Based Annotation and Analysis of Moralizing Speech Acts across Diverse Text Genres

Add code
Dec 17, 2025
Viaarxiv icon

DASH: Dialogue-Aware Similarity and Handshake Recognition for Topic Segmentation in Public-Channel Conversations

Add code
Dec 17, 2025
Viaarxiv icon

From Minutes to Days: Scaling Intracranial Speech Decoding with Supervised Pretraining

Add code
Dec 17, 2025
Viaarxiv icon

Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics

Add code
Dec 17, 2025
Figure 1 for Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics
Figure 2 for Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics
Figure 3 for Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics
Figure 4 for Towards Seamless Interaction: Causal Turn-Level Modeling of Interactive 3D Conversational Head Dynamics
Viaarxiv icon

On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation

Add code
Dec 17, 2025
Figure 1 for On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
Figure 2 for On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
Figure 3 for On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
Figure 4 for On the Use of Self-Supervised Representation Learning for Speaker Diarization and Separation
Viaarxiv icon

From Signal to Turn: Interactional Friction in Modular Speech-to-Speech Pipelines

Add code
Dec 17, 2025
Figure 1 for From Signal to Turn: Interactional Friction in Modular Speech-to-Speech Pipelines
Figure 2 for From Signal to Turn: Interactional Friction in Modular Speech-to-Speech Pipelines
Viaarxiv icon

ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body

Add code
Dec 16, 2025
Figure 1 for ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Figure 2 for ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Figure 3 for ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Figure 4 for ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Viaarxiv icon

Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting

Add code
Dec 16, 2025
Figure 1 for Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
Figure 2 for Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
Figure 3 for Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
Figure 4 for Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
Viaarxiv icon

Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction

Add code
Dec 16, 2025
Figure 1 for Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Figure 2 for Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Figure 3 for Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Figure 4 for Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Viaarxiv icon