Picture for Xunliang Cai

Xunliang Cai

Alphabetical order by last name

Self-Distilled Agentic Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization

Add code
May 13, 2026
Viaarxiv icon

MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Add code
May 13, 2026
Viaarxiv icon

Asymmetric On-Policy Distillation: Bridging Exploitation and Imitation at the Token Level

Add code
May 07, 2026
Viaarxiv icon

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Add code
May 07, 2026
Viaarxiv icon

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Add code
May 04, 2026
Viaarxiv icon

FG$^2$-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control

Add code
Apr 21, 2026
Viaarxiv icon

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

Add code
Apr 20, 2026
Viaarxiv icon

SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention

Add code
Apr 15, 2026
Viaarxiv icon

General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

Add code
Apr 13, 2026
Viaarxiv icon