Picture for Yingfan MA

Yingfan MA

Smart Picks in the Dark: Towards Efficient RLVR for Reasoning via Tracing Metacognitive Pivots

Add code
Jun 03, 2026
Viaarxiv icon

GeoMin: Data-Efficient Semi-Supervised RLVR via Geometric Distribution Modeling

Add code
Jun 03, 2026
Viaarxiv icon

TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning

Add code
Dec 15, 2025
Figure 1 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 2 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 3 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 4 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Viaarxiv icon