Picture for Gergely Neu

Gergely Neu

Generalization bounds for mixing processes via delayed online-to-PAC conversions

Add code
Jun 18, 2024
Viaarxiv icon

Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently

Add code
Jun 06, 2024
Viaarxiv icon

Offline RL via Feature-Occupancy Gradient Ascent

Add code
May 22, 2024
Viaarxiv icon

Optimisic Information Directed Sampling

Add code
Feb 23, 2024
Viaarxiv icon

Dealing with unbounded gradients in stochastic saddle-point optimization

Add code
Feb 21, 2024
Viaarxiv icon

Adversarial Contextual Bandits Go Kernelized

Add code
Oct 02, 2023
Viaarxiv icon

Importance-Weighted Offline Learning Done Right

Add code
Sep 27, 2023
Figure 1 for Importance-Weighted Offline Learning Done Right
Figure 2 for Importance-Weighted Offline Learning Done Right
Figure 3 for Importance-Weighted Offline Learning Done Right
Viaarxiv icon

Online-to-PAC Conversions: Generalization Bounds via Regret Analysis

Add code
May 31, 2023
Viaarxiv icon

Offline Primal-Dual Reinforcement Learning for Linear MDPs

Add code
May 22, 2023
Figure 1 for Offline Primal-Dual Reinforcement Learning for Linear MDPs
Viaarxiv icon

First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

Add code
May 01, 2023
Viaarxiv icon