Picture for Vincent Y. F. Tan

Vincent Y. F. Tan

p-Mean Regret for Stochastic Bandits

Add code
Dec 14, 2024
Figure 1 for p-Mean Regret for Stochastic Bandits
Figure 2 for p-Mean Regret for Stochastic Bandits
Viaarxiv icon

Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning

Add code
Oct 15, 2024
Figure 1 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 2 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 3 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Figure 4 for Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning
Viaarxiv icon

Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits

Add code
Oct 10, 2024
Viaarxiv icon

Stochastic Bandits for Egalitarian Assignment

Add code
Oct 08, 2024
Figure 1 for Stochastic Bandits for Egalitarian Assignment
Figure 2 for Stochastic Bandits for Egalitarian Assignment
Figure 3 for Stochastic Bandits for Egalitarian Assignment
Figure 4 for Stochastic Bandits for Egalitarian Assignment
Viaarxiv icon

Best Arm Identification with Minimal Regret

Add code
Sep 27, 2024
Viaarxiv icon

A General Framework for Clustering and Distribution Matching with Bandit Feedback

Add code
Sep 08, 2024
Figure 1 for A General Framework for Clustering and Distribution Matching with Bandit Feedback
Figure 2 for A General Framework for Clustering and Distribution Matching with Bandit Feedback
Figure 3 for A General Framework for Clustering and Distribution Matching with Bandit Feedback
Figure 4 for A General Framework for Clustering and Distribution Matching with Bandit Feedback
Viaarxiv icon

A Sample Efficient Alternating Minimization-based Algorithm For Robust Phase Retrieval

Add code
Sep 07, 2024
Figure 1 for A Sample Efficient Alternating Minimization-based Algorithm For Robust Phase Retrieval
Figure 2 for A Sample Efficient Alternating Minimization-based Algorithm For Robust Phase Retrieval
Figure 3 for A Sample Efficient Alternating Minimization-based Algorithm For Robust Phase Retrieval
Viaarxiv icon

LEARN: An Invex Loss for Outlier Oblivious Robust Online Optimization

Add code
Aug 12, 2024
Figure 1 for LEARN: An Invex Loss for Outlier Oblivious Robust Online Optimization
Figure 2 for LEARN: An Invex Loss for Outlier Oblivious Robust Online Optimization
Figure 3 for LEARN: An Invex Loss for Outlier Oblivious Robust Online Optimization
Figure 4 for LEARN: An Invex Loss for Outlier Oblivious Robust Online Optimization
Viaarxiv icon

A Mirror Descent-Based Algorithm for Corruption-Tolerant Distributed Gradient Descent

Add code
Jul 19, 2024
Viaarxiv icon

Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback

Add code
Jun 18, 2024
Figure 1 for Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback
Viaarxiv icon