Picture for Qin-Wen Luo

Qin-Wen Luo

Compress the Easy, Explore the Hard: Difficulty-Aware Entropy Regularization for Efficient LLM Reasoning

Add code
Feb 26, 2026
Viaarxiv icon

Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL

Add code
May 26, 2025
Viaarxiv icon

Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL

Add code
Dec 25, 2024
Figure 1 for Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL
Figure 2 for Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL
Figure 3 for Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL
Figure 4 for Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL
Viaarxiv icon