Q Learning


Learning to Play Blackjack: A Curriculum Learning Perspective

Add code
Apr 02, 2026
Viaarxiv icon

Learn by Surprise, Commit by Proof

Add code
Apr 02, 2026
Viaarxiv icon

Residuals-based Offline Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

Full-Gradient Successor Feature Representations

Add code
Apr 01, 2026
Viaarxiv icon

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

Coupled Query-Key Dynamics for Attention

Add code
Apr 02, 2026
Viaarxiv icon

Soft MPCritic: Amortized Model Predictive Value Iteration

Add code
Apr 01, 2026
Viaarxiv icon

Learning Compact Terrain-Context Representations for Feasibility-Aware Offline Reinforcement Learning in UAV Relaying Networks

Add code
Mar 31, 2026
Viaarxiv icon

Activation Saturation and Floquet Spectrum Collapse in Neural ODEs

Add code
Apr 01, 2026
Viaarxiv icon

Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search with EEG Data

Add code
Mar 30, 2026
Viaarxiv icon