Picture for Renhong Chen

Renhong Chen

TreeAdv: Tree-Structured Advantage Redistribution for Group-Based RL

Add code
Jan 07, 2026
Viaarxiv icon

Process Reward Modeling with Entropy-Driven Uncertainty

Add code
Mar 28, 2025
Figure 1 for Process Reward Modeling with Entropy-Driven Uncertainty
Figure 2 for Process Reward Modeling with Entropy-Driven Uncertainty
Figure 3 for Process Reward Modeling with Entropy-Driven Uncertainty
Figure 4 for Process Reward Modeling with Entropy-Driven Uncertainty
Viaarxiv icon