Picture for Haizhou Li

Haizhou Li

TP-Spikformer: Token Pruned Spiking Transformer

Add code
Feb 28, 2026
Viaarxiv icon

Discourse-Aware Dual-Track Streaming Response for Low-Latency Spoken Dialogue Systems

Add code
Feb 26, 2026
Viaarxiv icon

Robust Spiking Neural Networks Against Adversarial Attacks

Add code
Feb 24, 2026
Viaarxiv icon

CosyAccent: Duration-Controllable Accent Normalization Using Source-Synthesis Training Data

Add code
Feb 22, 2026
Viaarxiv icon

AudioRAG: A Challenging Benchmark for Audio Reasoning and Information Retrieval

Add code
Feb 11, 2026
Viaarxiv icon

Detect, Attend and Extract: Keyword Guided Target Speaker Extraction

Add code
Feb 08, 2026
Viaarxiv icon

EmoShift: Lightweight Activation Steering for Enhanced Emotion-Aware Speech Synthesis

Add code
Jan 30, 2026
Viaarxiv icon

CATCH: A Controllable Theme Detection Framework with Contextualized Clustering and Hierarchical Generation

Add code
Dec 25, 2025
Viaarxiv icon

ELEGANCE: Efficient LLM Guidance for Audio-Visual Target Speech Extraction

Add code
Nov 09, 2025
Viaarxiv icon

EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models

Add code
Oct 26, 2025
Viaarxiv icon