Picture for Xiefeng Wu

Xiefeng Wu

Off-Policy Actor-Critic with Sigmoid-Bounded Entropy for Real-World Robot Learning

Add code
Jan 22, 2026
Viaarxiv icon

From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge

Add code
Oct 02, 2024
Viaarxiv icon

Enhancing Q-Learning with Large Language Model Heuristics

Add code
May 06, 2024
Viaarxiv icon