speech


Mind-Paced Speaking: A Dual-Brain Approach to Real-Time Reasoning in Spoken Language Models

Add code
Oct 10, 2025
Viaarxiv icon

VoiceAgentBench: Are Voice Assistants ready for agentic tasks?

Add code
Oct 09, 2025
Figure 1 for VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
Figure 2 for VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
Figure 3 for VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
Figure 4 for VoiceAgentBench: Are Voice Assistants ready for agentic tasks?
Viaarxiv icon

CS3-Bench: Evaluating and Enhancing Speech-to-Speech LLMs for Mandarin-English Code-Switching

Add code
Oct 09, 2025
Figure 1 for CS3-Bench: Evaluating and Enhancing Speech-to-Speech LLMs for Mandarin-English Code-Switching
Figure 2 for CS3-Bench: Evaluating and Enhancing Speech-to-Speech LLMs for Mandarin-English Code-Switching
Figure 3 for CS3-Bench: Evaluating and Enhancing Speech-to-Speech LLMs for Mandarin-English Code-Switching
Figure 4 for CS3-Bench: Evaluating and Enhancing Speech-to-Speech LLMs for Mandarin-English Code-Switching
Viaarxiv icon

Bloodroot: When Watermarking Turns Poisonous For Stealthy Backdoor

Add code
Oct 09, 2025
Viaarxiv icon

Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation

Add code
Oct 09, 2025
Viaarxiv icon

Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition

Add code
Oct 09, 2025
Viaarxiv icon

Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects

Add code
Oct 09, 2025
Figure 1 for Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
Figure 2 for Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
Figure 3 for Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
Figure 4 for Standard-to-Dialect Transfer Trends Differ across Text and Speech: A Case Study on Intent and Topic Classification in German Dialects
Viaarxiv icon

Full-Duplex-Bench-v2: A Multi-Turn Evaluation Framework for Duplex Dialogue Systems with an Automated Examiner

Add code
Oct 09, 2025
Viaarxiv icon

Causality Guided Representation Learning for Cross-Style Hate Speech Detection

Add code
Oct 09, 2025
Figure 1 for Causality Guided Representation Learning for Cross-Style Hate Speech Detection
Figure 2 for Causality Guided Representation Learning for Cross-Style Hate Speech Detection
Figure 3 for Causality Guided Representation Learning for Cross-Style Hate Speech Detection
Figure 4 for Causality Guided Representation Learning for Cross-Style Hate Speech Detection
Viaarxiv icon

IsoSignVid2Aud: Sign Language Video to Audio Conversion without Text Intermediaries

Add code
Oct 09, 2025
Viaarxiv icon