Picture for Yankai Lin

Yankai Lin

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Add code
Mar 15, 2026
Viaarxiv icon

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Add code
Feb 12, 2026
Viaarxiv icon

AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research

Add code
Feb 06, 2026
Viaarxiv icon

AgentCPM-Explore: Realizing Long-Horizon Deep Exploration for Edge-Scale Agents

Add code
Feb 06, 2026
Viaarxiv icon

Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification

Add code
Jan 29, 2026
Viaarxiv icon

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Add code
Jan 21, 2026
Viaarxiv icon

AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

Add code
Jan 13, 2026
Viaarxiv icon

Forest Before Trees: Latent Superposition for Efficient Visual Reasoning

Add code
Jan 11, 2026
Viaarxiv icon

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Add code
Jun 09, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Figure 1 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 2 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 3 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 4 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Viaarxiv icon