Picture for Guanghui Yang

Guanghui Yang

LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem Exploration

Add code
Nov 11, 2025
Viaarxiv icon