Picture for Haozhe Wang

Haozhe Wang

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

Add code
Mar 17, 2026
Viaarxiv icon

Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents

Add code
Mar 11, 2026
Viaarxiv icon

Unified Structural-Hydrodynamic Modeling of Underwater Underactuated Mechanisms and Soft Robots

Add code
Mar 09, 2026
Viaarxiv icon

Physical Human-Robot Interaction for Grasping in Augmented Reality via Rigid-Soft Robot Synergy

Add code
Feb 19, 2026
Viaarxiv icon

Asynchronous Verified Semantic Caching for Tiered LLM Architectures

Add code
Feb 13, 2026
Viaarxiv icon

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Add code
Jan 23, 2026
Viaarxiv icon

CogDoc: Towards Unified thinking in Documents

Add code
Dec 14, 2025
Viaarxiv icon

A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports

Add code
Oct 02, 2025
Viaarxiv icon

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Add code
Sep 03, 2025
Figure 1 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 2 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 3 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 4 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Viaarxiv icon

URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning

Add code
May 26, 2025
Figure 1 for URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning
Figure 2 for URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning
Figure 3 for URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning
Figure 4 for URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning
Viaarxiv icon