Picture for Enlu Zhou

Enlu Zhou

Improving Bayesian Optimization via Training-Aware Conditional Diffusion Models

Add code
Jun 07, 2026
Viaarxiv icon

Optimal Data Acquisition for Reinforcement Learning: A Large Deviations Perspective

Add code
May 27, 2026
Viaarxiv icon

Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs

Add code
May 23, 2026
Viaarxiv icon

Adaptive Simulation Experiment for LLM Policy Optimization

Add code
Apr 09, 2026
Viaarxiv icon

Curiosity is Knowledge: Self-Consistent Learning and No-Regret Optimization with Active Inference

Add code
Feb 05, 2026
Viaarxiv icon

Pragmatic Curiosity: A Hybrid Learning-Optimization Paradigm via Active Inference

Add code
Feb 05, 2026
Viaarxiv icon

Policy Gradient Optimzation for Bayesian-Risk MDPs with General Convex Losses

Add code
Sep 19, 2025
Viaarxiv icon

Online Bayesian Risk-Averse Reinforcement Learning

Add code
Sep 17, 2025
Viaarxiv icon

Ranking and Selection with Simultaneous Input Data Collection

Add code
Mar 14, 2025
Viaarxiv icon

Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate

Add code
Mar 01, 2024
Figure 1 for Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Figure 2 for Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Figure 3 for Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Figure 4 for Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Viaarxiv icon