Picture for Junxian He

Junxian He

Dr.~RTL: Autonomous Agentic RTL Optimization through Tool-Grounded Self-Improvement

Add code
Apr 16, 2026
Viaarxiv icon

E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning

Add code
Apr 10, 2026
Viaarxiv icon

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Add code
Mar 10, 2026
Viaarxiv icon

SkillCraft: Can LLM Agents Learn to Use Tools Skillfully?

Add code
Feb 28, 2026
Viaarxiv icon

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Add code
Feb 26, 2026
Viaarxiv icon

LOCA-bench: Benchmarking Language Agents Under Controllable and Extreme Context Growth

Add code
Feb 08, 2026
Viaarxiv icon

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Add code
Feb 05, 2026
Viaarxiv icon

SWE-RM: Execution-free Feedback For Software Engineering Agents

Add code
Dec 26, 2025
Figure 1 for SWE-RM: Execution-free Feedback For Software Engineering Agents
Figure 2 for SWE-RM: Execution-free Feedback For Software Engineering Agents
Figure 3 for SWE-RM: Execution-free Feedback For Software Engineering Agents
Figure 4 for SWE-RM: Execution-free Feedback For Software Engineering Agents
Viaarxiv icon

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Add code
Oct 29, 2025
Viaarxiv icon

Model-Task Alignment Drives Distinct RL Outcomes

Add code
Aug 28, 2025
Viaarxiv icon