Picture for Xiaoxia Wu

Xiaoxia Wu

Search Your Block Floating Point Scales!

Add code
May 12, 2026
Viaarxiv icon

SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving

Add code
Apr 21, 2026
Viaarxiv icon

Introspective Diffusion Language Models

Add code
Apr 13, 2026
Viaarxiv icon

Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

Add code
Apr 09, 2026
Viaarxiv icon

CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention

Add code
Mar 18, 2026
Viaarxiv icon

$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners

Add code
Mar 04, 2026
Viaarxiv icon

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System

Add code
Feb 06, 2026
Viaarxiv icon

Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Add code
Dec 31, 2025
Viaarxiv icon

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training

Add code
Nov 17, 2025
Viaarxiv icon

Mojito: Motion Trajectory and Intensity Control for Video Generation

Add code
Dec 12, 2024
Figure 1 for Mojito: Motion Trajectory and Intensity Control for Video Generation
Figure 2 for Mojito: Motion Trajectory and Intensity Control for Video Generation
Figure 3 for Mojito: Motion Trajectory and Intensity Control for Video Generation
Figure 4 for Mojito: Motion Trajectory and Intensity Control for Video Generation
Viaarxiv icon