Picture for Qingping Yang

Qingping Yang

AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning

Add code
May 17, 2025
Viaarxiv icon

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Add code
Mar 31, 2025
Viaarxiv icon

UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts

Add code
Nov 11, 2024
Viaarxiv icon

How to Understand Whole Software Repository?

Add code
Jun 03, 2024
Figure 1 for How to Understand Whole Software Repository?
Figure 2 for How to Understand Whole Software Repository?
Figure 3 for How to Understand Whole Software Repository?
Figure 4 for How to Understand Whole Software Repository?
Viaarxiv icon