reinforcement learning


Integrating Trajectory Optimization and Reinforcement Learning for Quadrupedal Jumping with Terrain-Adaptive Landing

Add code
Sep 16, 2025
Viaarxiv icon

Perception Before Reasoning: Two-Stage Reinforcement Learning for Visual Reasoning in Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

Pre-trained Visual Representations Generalize Where it Matters in Model-Based Reinforcement Learning

Add code
Sep 16, 2025
Viaarxiv icon

GRATE: a Graph transformer-based deep Reinforcement learning Approach for Time-efficient autonomous robot Exploration

Add code
Sep 16, 2025
Viaarxiv icon

Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use

Add code
Sep 16, 2025
Viaarxiv icon

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Add code
Sep 16, 2025
Viaarxiv icon

The Anatomy of Alignment: Decomposing Preference Optimization by Steering Sparse Features

Add code
Sep 16, 2025
Viaarxiv icon

Safe Reinforcement Learning using Action Projection: Safeguard the Policy or the Environment?

Add code
Sep 16, 2025
Viaarxiv icon

Towards Context-Aware Human-like Pointing Gestures with RL Motion Imitation

Add code
Sep 16, 2025
Viaarxiv icon

Empowering Multi-Robot Cooperation via Sequential World Models

Add code
Sep 16, 2025
Viaarxiv icon