Picture for Jieping Ye

Jieping Ye

University of Michigan, DiDi Chuxing

SkillComposer: Learning to Evolve Agent Skills for Specification and Generalization

Add code
Jun 04, 2026
Viaarxiv icon

Scaling Self-Evolving Agents via Parametric Memory

Add code
Jun 03, 2026
Viaarxiv icon

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

Add code
Jun 02, 2026
Viaarxiv icon

STAMP: Training Explicit Memory for Mobile GUI Agents in Controllable and Scalable Virtual Environments

Add code
May 28, 2026
Viaarxiv icon

ESPO: Early-Stopping Proximal Policy Optimization

Add code
May 28, 2026
Viaarxiv icon

MPDocBench-Parse: Benchmarking Practical Multi-page Document Parsing

Add code
May 21, 2026
Viaarxiv icon

Backtracking When It Strays: Mitigating Dual Exposure Biases in LLM Reasoning Distillation

Add code
May 19, 2026
Viaarxiv icon

Are Rationales Necessary and Sufficient? Tuning LLMs for Explainable Misinformation Detection

Add code
May 19, 2026
Viaarxiv icon

Prefix Teach, Suffix Fade: Local Teachability Collapse in Strong-to-Weak On-Policy Distillation

Add code
May 13, 2026
Viaarxiv icon

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Add code
May 12, 2026
Viaarxiv icon