Picture for Zhengxi Lu

Zhengxi Lu

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

Add code
Jun 25, 2026
Viaarxiv icon

Finding the Evidence: Discovering Decision-Supporting Tokens for On-Policy Reasoning Distillation

Add code
Jun 22, 2026
Viaarxiv icon

GUI-CIDER: Mid-training GUI Agents via Causal Internalization and Density-aware Exemplar Reselection

Add code
May 27, 2026
Viaarxiv icon

Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles

Add code
May 21, 2026
Viaarxiv icon

Self-Distilled Agentic Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Add code
May 07, 2026
Viaarxiv icon

UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding

Add code
Apr 15, 2026
Viaarxiv icon

UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization

Add code
Apr 15, 2026
Viaarxiv icon

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Add code
Apr 09, 2026
Viaarxiv icon

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Add code
Apr 02, 2026
Viaarxiv icon