Picture for Csaba Szepesvári

Csaba Szepesvári

Sharper Guarantees for Misspecified Kernelized Bandit Optimization

Add code
May 07, 2026
Viaarxiv icon

To See the Unseen: on the Generalization Ability of Transformers in Symbolic Reasoning

Add code
Apr 23, 2026
Viaarxiv icon

LACONIC: Length-Aware Constrained Reinforcement Learning for LLM

Add code
Feb 16, 2026
Viaarxiv icon

Sharp analysis of linear ensemble sampling

Add code
Feb 08, 2026
Viaarxiv icon

Efficient Simple Regret Algorithms for Stochastic Contextual Bandits

Add code
Jan 29, 2026
Viaarxiv icon

Eluder dimension: localise it!

Add code
Jan 14, 2026
Viaarxiv icon

Frontier LLMs Still Struggle with Simple Reasoning Tasks

Add code
Jul 09, 2025
Figure 1 for Frontier LLMs Still Struggle with Simple Reasoning Tasks
Figure 2 for Frontier LLMs Still Struggle with Simple Reasoning Tasks
Figure 3 for Frontier LLMs Still Struggle with Simple Reasoning Tasks
Figure 4 for Frontier LLMs Still Struggle with Simple Reasoning Tasks
Viaarxiv icon

Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits

Add code
Oct 01, 2024
Viaarxiv icon

Confident Natural Policy Gradient for Local Planning in $q_π$-realizable Constrained MDPs

Add code
Jun 26, 2024
Viaarxiv icon

Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^π$-Realizability and Concentrability

Add code
May 27, 2024
Viaarxiv icon