Picture for Qiwei Di

Qiwei Di

Near-Optimal Regret for KL-Regularized Multi-Armed Bandits

Add code
Mar 02, 2026
Viaarxiv icon

Dimension-Independent Convergence of Underdamped Langevin Monte Carlo in KL Divergence

Add code
Mar 02, 2026
Viaarxiv icon

Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers

Add code
Oct 18, 2024
Viaarxiv icon

Relative-Translation Invariant Wasserstein Distance

Add code
Sep 04, 2024
Figure 1 for Relative-Translation Invariant Wasserstein Distance
Figure 2 for Relative-Translation Invariant Wasserstein Distance
Figure 3 for Relative-Translation Invariant Wasserstein Distance
Figure 4 for Relative-Translation Invariant Wasserstein Distance
Viaarxiv icon

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

Add code
Apr 16, 2024
Figure 1 for Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Figure 2 for Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Figure 3 for Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Viaarxiv icon

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

Add code
Feb 14, 2024
Figure 1 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Figure 2 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Figure 3 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Viaarxiv icon

Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning

Add code
Oct 02, 2023
Figure 1 for Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
Viaarxiv icon

Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits

Add code
Oct 02, 2023
Figure 1 for Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits
Viaarxiv icon