reinforcement learning


Dynamical Priors as a Training Objective in Reinforcement Learning

Add code
Apr 23, 2026
Viaarxiv icon

Task-specific Subnetwork Discovery in Reinforcement Learning for Autonomous Underwater Navigation

Add code
Apr 23, 2026
Viaarxiv icon

Understanding and Mitigating Spurious Signal Amplification in Test-Time Reinforcement Learning for Math Reasoning

Add code
Apr 23, 2026
Viaarxiv icon

Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding

Add code
Apr 23, 2026
Viaarxiv icon

ReaGeo: Reasoning-Enhanced End-to-End Geocoding with LLMs

Add code
Apr 23, 2026
Viaarxiv icon

CAP: Controllable Alignment Prompting for Unlearning in LLMs

Add code
Apr 23, 2026
Viaarxiv icon

Reinforcing 3D Understanding in Point-VLMs via Geometric Reward Credit Assignment

Add code
Apr 23, 2026
Viaarxiv icon

Learn Weightlessness: Imitate Non-Self-Stabilizing Motions on Humanoid Robot

Add code
Apr 23, 2026
Viaarxiv icon

X2-N: A Transformable Wheel-legged Humanoid Robot with Dual-mode Locomotion and Manipulation

Add code
Apr 23, 2026
Viaarxiv icon

Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models

Add code
Apr 23, 2026
Viaarxiv icon