Picture for Shenzhi Yang

Shenzhi Yang

OPRD: On-Policy Representation Distillation

Add code
Jun 04, 2026
Viaarxiv icon

GeoMin: Data-Efficient Semi-Supervised RLVR via Geometric Distribution Modeling

Add code
Jun 03, 2026
Viaarxiv icon

Smart Picks in the Dark: Towards Efficient RLVR for Reasoning via Tracing Metacognitive Pivots

Add code
Jun 03, 2026
Viaarxiv icon

Can LLMs Learn to Reason Robustly under Noisy Supervision?

Add code
Apr 05, 2026
Viaarxiv icon

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Add code
Mar 15, 2026
Viaarxiv icon

TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning

Add code
Dec 15, 2025
Figure 1 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 2 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 3 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Figure 4 for TraPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning
Viaarxiv icon

Bounded and Uniform Energy-based Out-of-distribution Detection for Graphs

Add code
Apr 18, 2025
Viaarxiv icon

NodeReg: Mitigating the Imbalance and Distribution Shift Effects in Semi-Supervised Node Classification via Norm Consistency

Add code
Mar 05, 2025
Figure 1 for NodeReg: Mitigating the Imbalance and Distribution Shift Effects in Semi-Supervised Node Classification via Norm Consistency
Figure 2 for NodeReg: Mitigating the Imbalance and Distribution Shift Effects in Semi-Supervised Node Classification via Norm Consistency
Figure 3 for NodeReg: Mitigating the Imbalance and Distribution Shift Effects in Semi-Supervised Node Classification via Norm Consistency
Figure 4 for NodeReg: Mitigating the Imbalance and Distribution Shift Effects in Semi-Supervised Node Classification via Norm Consistency
Viaarxiv icon

Category-free Out-of-Distribution Node Detection with Feature Resonance

Add code
Feb 22, 2025
Viaarxiv icon

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

Add code
Oct 16, 2024
Figure 1 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 2 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 3 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 4 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Viaarxiv icon