Picture for Pengfei Liu

Pengfei Liu

Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States

Add code
May 23, 2025
Viaarxiv icon

Efficient Agent Training for Computer Use

Add code
May 20, 2025
Figure 1 for Efficient Agent Training for Computer Use
Figure 2 for Efficient Agent Training for Computer Use
Figure 3 for Efficient Agent Training for Computer Use
Figure 4 for Efficient Agent Training for Computer Use
Viaarxiv icon

DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models

Add code
May 20, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Figure 1 for Seed1.5-VL Technical Report
Figure 2 for Seed1.5-VL Technical Report
Figure 3 for Seed1.5-VL Technical Report
Figure 4 for Seed1.5-VL Technical Report
Viaarxiv icon

Generative AI Act II: Test Time Scaling Drives Cognition Engineering

Add code
Apr 21, 2025
Figure 1 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 2 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 3 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Figure 4 for Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Viaarxiv icon

DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments

Add code
Apr 07, 2025
Figure 1 for DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments
Figure 2 for DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments
Figure 3 for DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments
Figure 4 for DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments
Viaarxiv icon

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Add code
Apr 03, 2025
Viaarxiv icon

ToRL: Scaling Tool-Integrated RL

Add code
Mar 30, 2025
Viaarxiv icon

RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing

Add code
Mar 10, 2025
Figure 1 for RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing
Figure 2 for RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing
Figure 3 for RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing
Figure 4 for RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing
Viaarxiv icon

LIMR: Less is More for RL Scaling

Add code
Feb 17, 2025
Viaarxiv icon