Picture for Dongruo Zhou

Dongruo Zhou

Variance-Dependent Regret Bounds for Non-stationary Linear Bandits

Add code
Mar 15, 2024
Viaarxiv icon

DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training

Add code
Mar 05, 2024
Figure 1 for DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training
Figure 2 for DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training
Figure 3 for DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training
Figure 4 for DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training
Viaarxiv icon

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

Add code
Feb 14, 2024
Figure 1 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Figure 2 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Figure 3 for Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
Viaarxiv icon

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

Add code
Nov 23, 2023
Viaarxiv icon

Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency

Add code
Feb 21, 2023
Figure 1 for Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Figure 2 for Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Viaarxiv icon

Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

Add code
Dec 12, 2022
Figure 1 for Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
Viaarxiv icon

Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

Add code
Aug 10, 2022
Viaarxiv icon

Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs

Add code
May 23, 2022
Figure 1 for Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs
Viaarxiv icon

Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions

Add code
May 13, 2022
Figure 1 for Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions
Viaarxiv icon

Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds

Add code
Feb 28, 2022
Figure 1 for Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds
Viaarxiv icon