Picture for Peng Ye

Peng Ye

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Add code
Dec 30, 2025
Viaarxiv icon

EGM: Efficiently Learning General Motion Tracking Policy for High Dynamic Humanoid Whole-Body Control

Add code
Dec 22, 2025
Viaarxiv icon

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

M-GRPO: Stabilizing Self-Supervised Reinforcement Learning for Large Language Models with Momentum-Anchored Policy Optimization

Add code
Dec 15, 2025
Viaarxiv icon

P1: Mastering Physics Olympiads with Reinforcement Learning

Add code
Nov 17, 2025
Viaarxiv icon

Private Online Learning against an Adaptive Adversary: Realizable and Agnostic Settings

Add code
Oct 01, 2025
Viaarxiv icon

Private Realizable-to-Agnostic Transformation with Near-Optimal Sample Complexity

Add code
Oct 01, 2025
Viaarxiv icon

Learning Compact Representations of LLM Abilities via Item Response Theory

Add code
Oct 01, 2025
Figure 1 for Learning Compact Representations of LLM Abilities via Item Response Theory
Figure 2 for Learning Compact Representations of LLM Abilities via Item Response Theory
Figure 3 for Learning Compact Representations of LLM Abilities via Item Response Theory
Figure 4 for Learning Compact Representations of LLM Abilities via Item Response Theory
Viaarxiv icon

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Add code
Sep 10, 2025
Figure 1 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 2 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 3 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 4 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Viaarxiv icon

Beyond GPT-5: Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing

Add code
Aug 18, 2025
Viaarxiv icon