Picture for Gopala Anumanchipalli

Gopala Anumanchipalli

StyleStream: Real-Time Zero-Shot Voice Style Conversion

Add code
Feb 23, 2026
Viaarxiv icon

Conversational Behavior Modeling Foundation Model With Multi-Level Perception

Add code
Feb 11, 2026
Viaarxiv icon

Asymmetric Hierarchical Anchoring for Audio-Visual Joint Representation: Resolving Information Allocation Ambiguity for Robust Cross-Modal Generalization

Add code
Feb 03, 2026
Viaarxiv icon

HuPER: A Human-Inspired Framework for Phonetic Perception

Add code
Feb 02, 2026
Viaarxiv icon

Evolutionary Strategies lead to Catastrophic Forgetting in LLMs

Add code
Jan 28, 2026
Viaarxiv icon

Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech

Add code
Dec 25, 2025
Figure 1 for Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Figure 2 for Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Figure 3 for Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Figure 4 for Enabling Conversational Behavior Reasoning Capabilities in Full-Duplex Speech
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

How Do LLMs Use Their Depth?

Add code
Oct 21, 2025
Viaarxiv icon

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

Add code
Aug 25, 2025
Viaarxiv icon

LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness

Add code
Aug 05, 2025
Viaarxiv icon