Picture for Haoxu Wang

Haoxu Wang

SageBwd: A Trainable Low-bit Attention

Add code
Mar 02, 2026
Viaarxiv icon

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Add code
Feb 13, 2026
Viaarxiv icon

LuSeeL: Language-queried Binaural Universal Sound Event Extraction and Localization

Add code
Jan 27, 2026
Viaarxiv icon

E2E-AEC: Implementing an end-to-end neural network learning approach for acoustic echo cancellation

Add code
Jan 23, 2026
Viaarxiv icon

FlowSE-GRPO: Training Flow Matching Speech Enhancement via Online Reinforcement Learning

Add code
Jan 23, 2026
Viaarxiv icon

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Add code
Dec 18, 2025
Viaarxiv icon

MELA-TTS: Joint transformer-diffusion model with representation alignment for speech synthesis

Add code
Sep 18, 2025
Viaarxiv icon

FLASepformer: Efficient Speech Separation with Gated Focused Linear Attention Transformer

Add code
Aug 27, 2025
Viaarxiv icon

Exploring Efficient Directional and Distance Cues for Regional Speech Separation

Add code
Aug 11, 2025
Viaarxiv icon

SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training

Add code
May 16, 2025
Viaarxiv icon