Picture for Haozhe Ma

Haozhe Ma

Exploration by Random Reward Perturbation

Add code
Jun 10, 2025
Viaarxiv icon

Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic

Add code
Jun 05, 2025
Viaarxiv icon

JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon

Knowledge Sharing and Transfer via Centralized Reward Agent for Multi-Task Reinforcement Learning

Add code
Aug 20, 2024
Viaarxiv icon

Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning

Add code
Aug 07, 2024
Viaarxiv icon

Decoupled Prompt-Adapter Tuning for Continual Activity Recognition

Add code
Jul 20, 2024
Viaarxiv icon