Picture for Qiaomin Xie

Qiaomin Xie

A Piecewise Lyapunov Analysis of Sub-quadratic SGD: Applications to Robust and Quantile Regression

Add code
Apr 15, 2025
Viaarxiv icon

Optimally Installing Strict Equilibria

Add code
Mar 05, 2025
Viaarxiv icon

Coupling-based Convergence Diagnostic and Stepsize Scheme for Stochastic Gradient Descent

Add code
Dec 15, 2024
Viaarxiv icon

Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way

Add code
Oct 16, 2024
Viaarxiv icon

Stable Offline Value Function Learning with Bisimulation-based Representations

Add code
Oct 02, 2024
Figure 1 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 2 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 3 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 4 for Stable Offline Value Function Learning with Bisimulation-based Representations
Viaarxiv icon

Inception: Efficiently Computable Misinformation Attacks on Markov Games

Add code
Jun 24, 2024
Figure 1 for Inception: Efficiently Computable Misinformation Attacks on Markov Games
Figure 2 for Inception: Efficiently Computable Misinformation Attacks on Markov Games
Viaarxiv icon

Roping in Uncertainty: Robustness and Regularization in Markov Games

Add code
Jun 13, 2024
Viaarxiv icon

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

Add code
Jun 07, 2024
Figure 1 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 2 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 3 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Figure 4 for Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Viaarxiv icon

When is exponential asymptotic optimality achievable in average-reward restless bandits?

Add code
May 28, 2024
Viaarxiv icon

The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize

Add code
May 27, 2024
Viaarxiv icon