Picture for Shengtian Yang

Shengtian Yang

Southeast University, Kuaishou Technology

Phase-Aware Mixture of Experts for Agentic Reinforcement Learning

Add code
Feb 19, 2026
Viaarxiv icon

PhGPO: Pheromone-Guided Policy Optimization for Long-Horizon Tool Planning

Add code
Feb 14, 2026
Viaarxiv icon

MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection

Add code
Oct 24, 2025
Viaarxiv icon