Picture for Tanmay Ambadkar

Tanmay Ambadkar

Preference Conditioned Multi-Objective Reinforcement Learning: Decomposed, Diversity-Driven Policy Optimization

Add code
Feb 08, 2026
Viaarxiv icon