Picture for Huazheng Wang

Huazheng Wang

Eugene

Online KL-Regularized Reinforcement Learning with Function Approximation under Misspecification

Add code
Jun 04, 2026
Viaarxiv icon

EvoPool: Evolutionary Programmatic Annotation for Label-Efficient Specialized Supervision

Add code
Jun 01, 2026
Viaarxiv icon

Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline Parallelism

Add code
May 29, 2026
Viaarxiv icon

When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs

Add code
May 22, 2026
Viaarxiv icon

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs

Add code
May 12, 2026
Viaarxiv icon

When Can You Poison Rewards? A Tight Characterization of Reward Poisoning in Linear MDPs

Add code
Apr 11, 2026
Viaarxiv icon

Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio

Add code
Mar 26, 2026
Viaarxiv icon

Live-Evo: Online Evolution of Agentic Memory from Continuous Feedback

Add code
Feb 02, 2026
Viaarxiv icon

Sliding Window Attention Adaptation

Add code
Dec 16, 2025
Viaarxiv icon