Multi Objective Reinforcement Learning


PEARL: Training Socratic Tutors with Pedagogically Aligned Reinforcement Learning

Add code
May 28, 2026
Viaarxiv icon

Learning Design Skills as Memory Policies for Agentic Photonic Inverse Design

Add code
May 28, 2026
Viaarxiv icon

Train the Agent, Not the Expert: Learning to Harness Heterogeneous Experts for Multi-Turn Visual Reasoning

Add code
May 28, 2026
Viaarxiv icon

On Distributional Reinforcement Learning in Chaotic Dynamical Systems

Add code
May 28, 2026
Viaarxiv icon

LLM-Guided Future Hypotheses for Horizon-Aware Exploration in Multi-Step Robot Manipulation

Add code
May 28, 2026
Viaarxiv icon

EvoRubric: Self-Evolving Rubric-Driven RL for Open-Ended Generation

Add code
May 28, 2026
Viaarxiv icon

FoundObj: Self-supervised Foundation Models as Rewards for Label-free 3D Object Segmentation

Add code
May 26, 2026
Viaarxiv icon

ProgVLA: Progress-Aware Robot Manipulation Skill Learning

Add code
May 27, 2026
Viaarxiv icon

Beyond pass@k: Redundancy-Aware RLVR for Multi-Sample Code Generation

Add code
May 27, 2026
Viaarxiv icon

Differentiable Belief-based Opponent Shaping

Add code
May 27, 2026
Viaarxiv icon