Picture for Haozhe Wang

Haozhe Wang

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Add code
Jan 23, 2026
Viaarxiv icon

CogDoc: Towards Unified thinking in Documents

Add code
Dec 14, 2025
Viaarxiv icon

A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports

Add code
Oct 02, 2025
Viaarxiv icon

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Add code
Sep 03, 2025
Figure 1 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 2 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 3 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Figure 4 for Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
Viaarxiv icon

Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models

Add code
May 26, 2025
Viaarxiv icon

URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning

Add code
May 26, 2025
Figure 1 for URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning
Figure 2 for URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning
Figure 3 for URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning
Figure 4 for URPlanner: A Universal Paradigm For Collision-Free Robotic Motion Planning Based on Deep Reinforcement Learning
Viaarxiv icon

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Add code
May 26, 2025
Figure 1 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 2 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 3 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Figure 4 for StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs
Viaarxiv icon

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Add code
May 23, 2025
Figure 1 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 2 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 3 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Figure 4 for Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL
Viaarxiv icon

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Add code
May 21, 2025
Figure 1 for Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Figure 2 for Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Figure 3 for Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Figure 4 for Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Viaarxiv icon

Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials

Add code
Apr 14, 2025
Figure 1 for Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials
Figure 2 for Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials
Figure 3 for Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials
Figure 4 for Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials
Viaarxiv icon