Picture for Hongxun Ding

Hongxun Ding

Towards Sample-Efficient and Stable Reinforcement Learning for LLM-based Recommendation

Add code
Jan 31, 2026
Viaarxiv icon