Picture for Pengfei Liu

Pengfei Liu

Hybrid Policy Distillation for LLMs

Add code
Apr 22, 2026
Viaarxiv icon

AlphaEval: Evaluating Agents in Production

Add code
Apr 14, 2026
Viaarxiv icon

SepSeq: A Training-Free Framework for Long Numerical Sequence Processing in LLMs

Add code
Apr 09, 2026
Viaarxiv icon

Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks

Add code
Apr 03, 2026
Viaarxiv icon

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Add code
Apr 02, 2026
Viaarxiv icon

ASI-Evolve: AI Accelerates AI

Add code
Mar 31, 2026
Viaarxiv icon

CiQi-Agent: Aligning Vision, Tools and Aesthetics in Multimodal Agent for Cultural Reasoning on Chinese Porcelains

Add code
Mar 30, 2026
Viaarxiv icon

PRBench: End-to-end Paper Reproduction in Physics Research

Add code
Mar 29, 2026
Viaarxiv icon

daVinci-LLM:Towards the Science of Pretraining

Add code
Mar 28, 2026
Viaarxiv icon

Caption Generation for Dongba Paintings via Prompt Learning and Semantic Fusion

Add code
Mar 24, 2026
Viaarxiv icon