Picture for Pengfei Liu

Pengfei Liu

InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research

Add code
Nov 03, 2025
Figure 1 for InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
Figure 2 for InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
Figure 3 for InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
Figure 4 for InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
Viaarxiv icon

Context Engineering 2.0: The Context of Context Engineering

Add code
Oct 30, 2025
Figure 1 for Context Engineering 2.0: The Context of Context Engineering
Figure 2 for Context Engineering 2.0: The Context of Context Engineering
Figure 3 for Context Engineering 2.0: The Context of Context Engineering
Figure 4 for Context Engineering 2.0: The Context of Context Engineering
Viaarxiv icon

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Add code
Sep 11, 2025
Viaarxiv icon

Proximal Supervised Fine-Tuning

Add code
Aug 25, 2025
Figure 1 for Proximal Supervised Fine-Tuning
Figure 2 for Proximal Supervised Fine-Tuning
Figure 3 for Proximal Supervised Fine-Tuning
Figure 4 for Proximal Supervised Fine-Tuning
Viaarxiv icon

DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery

Add code
Aug 09, 2025
Viaarxiv icon

AlphaGo Moment for Model Architecture Discovery

Add code
Jul 24, 2025
Figure 1 for AlphaGo Moment for Model Architecture Discovery
Figure 2 for AlphaGo Moment for Model Architecture Discovery
Figure 3 for AlphaGo Moment for Model Architecture Discovery
Figure 4 for AlphaGo Moment for Model Architecture Discovery
Viaarxiv icon

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Add code
Jul 22, 2025
Figure 1 for MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
Figure 2 for MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
Figure 3 for MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
Figure 4 for MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning
Viaarxiv icon

Thinking with Generated Images

Add code
May 28, 2025
Viaarxiv icon

LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling

Add code
May 25, 2025
Viaarxiv icon

One RL to See Them All: Visual Triple Unified Reinforcement Learning

Add code
May 23, 2025
Figure 1 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 2 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 3 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Figure 4 for One RL to See Them All: Visual Triple Unified Reinforcement Learning
Viaarxiv icon