reinforcement learning


Abstraction for Offline Goal-Conditioned Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

Behavior-Consistent Deep Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

Kernel-Based Safe Exploration in Deep Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

Add code
May 21, 2026
Viaarxiv icon

Reinforcement learning for ion shuttling on trapped-ion quantum computers

Add code
May 21, 2026
Viaarxiv icon

From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning

Add code
May 21, 2026
Viaarxiv icon

Don't Forget the Critic: Value-Based Data Rehearsal for Multi-Cyclic Continual Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

General Preference Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

Target-Aligned Bellman Backup for Cross-domain Offline Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

Chebyshev Policies and the Mountain Car Problem: Reinforcement Learning for Low-Dimensional Control Tasks

Add code
May 21, 2026
Viaarxiv icon