Picture for Zhipeng Wang

Zhipeng Wang

PACED: Distillation and Self-Distillation at the Frontier of Student Competence

Add code
Mar 16, 2026
Viaarxiv icon

PACED: Distillation at the Frontier of Student Competence

Add code
Mar 11, 2026
Viaarxiv icon

Dual-Horizon Hybrid Internal Model for Low-Gravity Quadrupedal Jumping with Hardware-in-the-Loop Validation

Add code
Mar 09, 2026
Viaarxiv icon

On-Policy Self-Distillation for Reasoning Compression

Add code
Mar 05, 2026
Viaarxiv icon

OTPrune: Distribution-Aligned Visual Token Pruning via Optimal Transport

Add code
Feb 25, 2026
Viaarxiv icon

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Add code
Feb 24, 2026
Viaarxiv icon

Morphogenetic Assembly and Adaptive Control for Heterogeneous Modular Robots

Add code
Feb 11, 2026
Viaarxiv icon

Bayesian Preference Learning for Test-Time Steerable Reward Models

Add code
Feb 09, 2026
Viaarxiv icon

Semantic Search At LinkedIn

Add code
Feb 07, 2026
Viaarxiv icon

Scaling In-Context Online Learning Capability of LLMs via Cross-Episode Meta-RL

Add code
Feb 03, 2026
Viaarxiv icon