Model Based Reinforcement Learning


Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)

Add code
May 22, 2025
Viaarxiv icon

Gaze Into the Abyss -- Planning to Seek Entropy When Reward is Scarce

Add code
May 22, 2025
Viaarxiv icon

SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning

Add code
May 22, 2025
Viaarxiv icon

KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning

Add code
May 22, 2025
Viaarxiv icon

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Add code
May 22, 2025
Viaarxiv icon

VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving

Add code
May 22, 2025
Viaarxiv icon

Interactive Post-Training for Vision-Language-Action Models

Add code
May 22, 2025
Viaarxiv icon

Improving planning and MBRL with temporally-extended actions

Add code
May 21, 2025
Viaarxiv icon

Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies

Add code
May 22, 2025
Viaarxiv icon

Distilling the Implicit Multi-Branch Structure in LLMs' Reasoning via Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon