Picture for Jieping Ye

Jieping Ye

University of Michigan, DiDi Chuxing

ESPO: Early-Stopping Proximal Policy Optimization

Add code
May 28, 2026
Viaarxiv icon

STAMP: Training Explicit Memory for Mobile GUI Agents in Controllable and Scalable Virtual Environments

Add code
May 28, 2026
Viaarxiv icon

MPDocBench-Parse: Benchmarking Practical Multi-page Document Parsing

Add code
May 21, 2026
Viaarxiv icon

Are Rationales Necessary and Sufficient? Tuning LLMs for Explainable Misinformation Detection

Add code
May 19, 2026
Viaarxiv icon

Backtracking When It Strays: Mitigating Dual Exposure Biases in LLM Reasoning Distillation

Add code
May 19, 2026
Viaarxiv icon

Prefix Teach, Suffix Fade: Local Teachability Collapse in Strong-to-Weak On-Policy Distillation

Add code
May 13, 2026
Viaarxiv icon

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Add code
May 12, 2026
Viaarxiv icon

On the Step Length Confounding in LLM Reasoning Data Selection

Add code
Apr 08, 2026
Viaarxiv icon

AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References

Add code
Mar 26, 2026
Viaarxiv icon

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

Add code
Mar 03, 2026
Viaarxiv icon