Picture for Siddhant Arora

Siddhant Arora

Full-Duplex-Bench-v2: A Multi-Turn Evaluation Framework for Duplex Dialogue Systems with an Automated Examiner

Add code
Oct 09, 2025
Viaarxiv icon

Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage

Add code
Oct 02, 2025
Viaarxiv icon

Chain-of-Thought Reasoning in Streaming Full-Duplex End-to-End Spoken Dialogue Systems

Add code
Oct 02, 2025
Viaarxiv icon

Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs

Add code
Jun 12, 2025
Viaarxiv icon

ARECHO: Autoregressive Evaluation via Chain-Based Hypothesis Optimization for Speech Multi-Metric Estimation

Add code
May 30, 2025
Viaarxiv icon

BLAB: Brutally Long Audio Bench

Add code
May 05, 2025
Viaarxiv icon

On The Landscape of Spoken Language Models: A Comprehensive Survey

Add code
Apr 11, 2025
Viaarxiv icon

ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems

Add code
Mar 11, 2025
Figure 1 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 2 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 3 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Figure 4 for ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
Viaarxiv icon

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics

Add code
Mar 03, 2025
Figure 1 for Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Figure 2 for Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Figure 3 for Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Figure 4 for Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Viaarxiv icon

ESPnet-SpeechLM: An Open Speech Language Model Toolkit

Add code
Feb 21, 2025
Figure 1 for ESPnet-SpeechLM: An Open Speech Language Model Toolkit
Figure 2 for ESPnet-SpeechLM: An Open Speech Language Model Toolkit
Figure 3 for ESPnet-SpeechLM: An Open Speech Language Model Toolkit
Figure 4 for ESPnet-SpeechLM: An Open Speech Language Model Toolkit
Viaarxiv icon