Picture for Hao Wang

Hao Wang

Xidian University, China

Offline Policy Evaluation for Manipulation Policies via Discounted Liveness Formulation

Add code
May 12, 2026
Viaarxiv icon

Do Androids Dream of Breaking the Game? Systematically Auditing AI Agent Benchmarks with BenchJack

Add code
May 12, 2026
Viaarxiv icon

StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning

Add code
May 12, 2026
Viaarxiv icon

BabelDOC: Better Layout-Preserving PDF Translation via Intermediate Representation

Add code
May 11, 2026
Viaarxiv icon

VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

LoopVLA: Learning Sufficiency in Recurrent Refinement for Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

Efficient Serving for Dynamic Agent Workflows with Prediction-based KV-Cache Management

Add code
May 07, 2026
Viaarxiv icon

MaMi-HOI: Harmonizing Global Kinematics and Local Geometry for Human-Object Interaction Generation

Add code
May 07, 2026
Viaarxiv icon

MotionGRPO: Overcoming Low Intra-Group Diversity in GRPO-Based Egocentric Motion Recovery

Add code
May 07, 2026
Viaarxiv icon

Optimal Transport for LLM Reward Modeling from Noisy Preference

Add code
May 07, 2026
Viaarxiv icon