Picture for Runming He

Runming He

One-Eval: An Agentic System for Automated and Traceable LLM Evaluation

Add code
Mar 10, 2026
Viaarxiv icon

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Add code
Dec 18, 2025
Viaarxiv icon

Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions

Add code
Jun 09, 2025
Figure 1 for Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Figure 2 for Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Figure 3 for Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Figure 4 for Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
Viaarxiv icon

LogicPuzzleRL: Cultivating Robust Mathematical Reasoning in LLMs via Reinforcement Learning

Add code
Jun 05, 2025
Viaarxiv icon

Let's Verify Math Questions Step by Step

Add code
May 20, 2025
Viaarxiv icon