Picture for Pengfei Liu

Pengfei Liu

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Add code
Apr 21, 2025
Viaarxiv icon

DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments

Add code
Apr 07, 2025
Viaarxiv icon

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Add code
Apr 03, 2025
Viaarxiv icon

ToRL: Scaling Tool-Integrated RL

Add code
Mar 30, 2025
Viaarxiv icon

RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing

Add code
Mar 10, 2025
Viaarxiv icon

LIMR: Less is More for RL Scaling

Add code
Feb 17, 2025
Viaarxiv icon

LIMO: Less is More for Reasoning

Add code
Feb 05, 2025
Viaarxiv icon

Survey and Improvement Strategies for Gene Prioritization with Large Language Models

Add code
Jan 30, 2025
Figure 1 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Figure 2 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Figure 3 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Viaarxiv icon

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Add code
Jan 11, 2025
Viaarxiv icon

DIVE: Diversified Iterative Self-Improvement

Add code
Jan 01, 2025
Viaarxiv icon