Picture for Yan Dai

Yan Dai

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation

Add code
Feb 11, 2024
Viaarxiv icon

Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Add code
Feb 02, 2024
Viaarxiv icon

The Crucial Role of Normalization in Sharpness-Aware Minimization

Add code
May 24, 2023
Figure 1 for The Crucial Role of Normalization in Sharpness-Aware Minimization
Figure 2 for The Crucial Role of Normalization in Sharpness-Aware Minimization
Figure 3 for The Crucial Role of Normalization in Sharpness-Aware Minimization
Figure 4 for The Crucial Role of Normalization in Sharpness-Aware Minimization
Viaarxiv icon

Refined Regret for Adversarial MDPs with Linear Function Approximation

Add code
Jan 30, 2023
Figure 1 for Refined Regret for Adversarial MDPs with Linear Function Approximation
Viaarxiv icon

Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning

Add code
Jan 25, 2023
Figure 1 for Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning
Viaarxiv icon

Skeleton-based Action Recognition via Adaptive Cross-Form Learning

Add code
Jun 30, 2022
Figure 1 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 2 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 3 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 4 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Viaarxiv icon

Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback

Add code
May 26, 2022
Figure 1 for Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback
Viaarxiv icon

Variance-Aware Sparse Linear Bandits

Add code
May 26, 2022
Figure 1 for Variance-Aware Sparse Linear Bandits
Viaarxiv icon

Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits

Add code
Jan 28, 2022
Figure 1 for Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits
Viaarxiv icon

Scale-Free Adversarial Multi-Armed Bandit with Arbitrary Feedback Delays

Add code
Oct 26, 2021
Figure 1 for Scale-Free Adversarial Multi-Armed Bandit with Arbitrary Feedback Delays
Viaarxiv icon