Picture for Shijue Huang

Shijue Huang

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Add code
May 11, 2026
Viaarxiv icon

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Add code
Apr 20, 2026
Viaarxiv icon

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Add code
Feb 26, 2026
Viaarxiv icon

From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics

Add code
Jan 30, 2026
Viaarxiv icon

Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

Add code
Nov 12, 2025
Viaarxiv icon

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting

Add code
May 24, 2025
Viaarxiv icon

Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst

Add code
May 20, 2025
Figure 1 for Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
Figure 2 for Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
Figure 3 for Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
Figure 4 for Self-Reasoning Language Models: Unfold Hidden Reasoning Chains with Few Reasoning Catalyst
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Figure 1 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 2 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 3 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 4 for OTC: Optimal Tool Calls via Reinforcement Learning
Viaarxiv icon

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Add code
Apr 15, 2025
Viaarxiv icon

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Add code
Jan 21, 2025
Viaarxiv icon