Picture for Maryam Fazel

Maryam Fazel

Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models

Add code
Jun 29, 2024
Figure 1 for Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models
Viaarxiv icon

Offline Multi-task Transfer RL with Representational Penalization

Add code
Feb 19, 2024
Viaarxiv icon

Learning Optimal Tax Design in Nonatomic Congestion Games

Add code
Feb 12, 2024
Viaarxiv icon

Initializing Services in Interactive ML Systems for Diverse Users

Add code
Dec 19, 2023
Viaarxiv icon

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

Add code
Jul 27, 2023
Figure 1 for A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
Figure 2 for A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
Figure 3 for A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
Figure 4 for A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity
Viaarxiv icon

A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning

Add code
Jun 12, 2023
Figure 1 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 2 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 3 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Figure 4 for A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
Viaarxiv icon

No-Regret Online Prediction with Strategic Experts

Add code
May 24, 2023
Figure 1 for No-Regret Online Prediction with Strategic Experts
Viaarxiv icon

Stochastic Contextual Bandits with Long Horizon Rewards

Add code
Feb 03, 2023
Figure 1 for Stochastic Contextual Bandits with Long Horizon Rewards
Figure 2 for Stochastic Contextual Bandits with Long Horizon Rewards
Figure 3 for Stochastic Contextual Bandits with Long Horizon Rewards
Figure 4 for Stochastic Contextual Bandits with Long Horizon Rewards
Viaarxiv icon

Offline congestion games: How feedback type affects data coverage requirement

Add code
Oct 24, 2022
Figure 1 for Offline congestion games: How feedback type affects data coverage requirement
Figure 2 for Offline congestion games: How feedback type affects data coverage requirement
Figure 3 for Offline congestion games: How feedback type affects data coverage requirement
Figure 4 for Offline congestion games: How feedback type affects data coverage requirement
Viaarxiv icon

Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies

Add code
Oct 10, 2022
Figure 1 for Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Figure 2 for Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Figure 3 for Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Figure 4 for Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Viaarxiv icon