Picture for Tian-Shuo Liu

Tian-Shuo Liu

Off-Policy Value-Based Reinforcement Learning for Large Language Models

Add code
Mar 24, 2026
Viaarxiv icon

Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning

Add code
Jul 17, 2024
Figure 1 for Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
Figure 2 for Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
Figure 3 for Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
Figure 4 for Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
Viaarxiv icon