Picture for Xiaoyang Yuan

Xiaoyang Yuan

RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering

Add code
Jan 14, 2026
Viaarxiv icon

More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration

Add code
Oct 02, 2025
Figure 1 for More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration
Figure 2 for More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration
Figure 3 for More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration
Figure 4 for More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration
Viaarxiv icon