Picture for Gellért Weisz

Gellért Weisz

Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^π$-Realizability and Concentrability

Add code
May 27, 2024
Viaarxiv icon

Online RL in Linearly $q^π$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore

Add code
Oct 11, 2023
Viaarxiv icon

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Add code
May 18, 2023
Figure 1 for Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL
Viaarxiv icon

Exponential Hardness of Reinforcement Learning with Linear Function Approximation

Add code
Feb 25, 2023
Figure 1 for Exponential Hardness of Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Confident Approximate Policy Iteration for Efficient Local Planning in $q^π$-realizable MDPs

Add code
Oct 27, 2022
Figure 1 for Confident Approximate Policy Iteration for Efficient Local Planning in $q^π$-realizable MDPs
Viaarxiv icon

TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions

Add code
Oct 05, 2021
Figure 1 for TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions
Figure 2 for TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions
Viaarxiv icon

LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration

Add code
Jul 02, 2018
Figure 1 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Figure 2 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Figure 3 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Viaarxiv icon

Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces

Add code
Feb 11, 2018
Figure 1 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 2 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 3 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 4 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Viaarxiv icon