Alert button
Picture for Gellért Weisz

Gellért Weisz

Alert button

Online RL in Linearly $q^π$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore

Add code
Bookmark button
Alert button
Oct 11, 2023
Gellért Weisz, András György, Csaba Szepesvári

Viaarxiv icon

Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL

Add code
Bookmark button
Alert button
May 18, 2023
Qinghua Liu, Gellért Weisz, András György, Chi Jin, Csaba Szepesvári

Viaarxiv icon

Exponential Hardness of Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Feb 25, 2023
Daniel Kane, Sihan Liu, Shachar Lovett, Gaurav Mahajan, Csaba Szepesvári, Gellért Weisz

Figure 1 for Exponential Hardness of Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Confident Approximate Policy Iteration for Efficient Local Planning in $q^π$-realizable MDPs

Add code
Bookmark button
Alert button
Oct 27, 2022
Gellért Weisz, András György, Tadashi Kozuno, Csaba Szepesvári

Figure 1 for Confident Approximate Policy Iteration for Efficient Local Planning in $q^π$-realizable MDPs
Viaarxiv icon

TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions

Add code
Bookmark button
Alert button
Oct 05, 2021
Gellért Weisz, Csaba Szepesvári, András György

Figure 1 for TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions
Figure 2 for TensorPlan and the Few Actions Lower Bound for Planning in MDPs under Linear Realizability of Optimal Value Functions
Viaarxiv icon

LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration

Add code
Bookmark button
Alert button
Jul 02, 2018
Gellért Weisz, András György, Csaba Szepesvári

Figure 1 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Figure 2 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Figure 3 for LeapsAndBounds: A Method for Approximately Optimal Algorithm Configuration
Viaarxiv icon

Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces

Add code
Bookmark button
Alert button
Feb 11, 2018
Gellért Weisz, Paweł Budzianowski, Pei-Hao Su, Milica Gašić

Figure 1 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 2 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 3 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Figure 4 for Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Viaarxiv icon