Picture for Gopala Anumanchipalli

Gopala Anumanchipalli

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Add code
Aug 25, 2025
Viaarxiv icon

LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness

Add code
Aug 05, 2025
Viaarxiv icon

MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real

Add code
Jul 03, 2025
Viaarxiv icon

RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding

Add code
Jun 12, 2025
Viaarxiv icon

Efficient Knowledge Editing via Minimal Precomputation

Add code
Jun 04, 2025
Viaarxiv icon

Sounding that Object: Interactive Object-Aware Image to Audio Generation

Add code
Jun 04, 2025
Viaarxiv icon

Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection

Add code
May 28, 2025
Viaarxiv icon

Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection

Add code
May 22, 2025
Viaarxiv icon

Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks

Add code
Mar 12, 2025
Viaarxiv icon

Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities

Add code
Mar 06, 2025
Viaarxiv icon