Picture for Pengfei Liu

Pengfei Liu

Proximal Supervised Fine-Tuning

Add code
Aug 25, 2025
Viaarxiv icon

DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery

Add code
Aug 09, 2025
Viaarxiv icon

AlphaGo Moment for Model Architecture Discovery

Add code
Jul 24, 2025
Viaarxiv icon

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Add code
Jul 22, 2025
Viaarxiv icon

Thinking with Generated Images

Add code
May 28, 2025
Viaarxiv icon

LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling

Add code
May 25, 2025
Viaarxiv icon

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon

Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States

Add code
May 23, 2025
Viaarxiv icon

DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models

Add code
May 20, 2025
Viaarxiv icon

Efficient Agent Training for Computer Use

Add code
May 20, 2025
Viaarxiv icon