Picture for Bernhard Schölkopf

Bernhard Schölkopf

MPI-IS

On the Variance of Temporal Difference Learning and its Reduction Using Control Variates

Add code
Jun 18, 2026
Viaarxiv icon

Sensorimotor World Models: Perception for Action via Inverse Dynamics

Add code
Jun 18, 2026
Viaarxiv icon

Direct Advantage Estimation for Scalable and Sample-efficient Deep Reinforcement Learning

Add code
Jun 18, 2026
Viaarxiv icon

Slots, Transitions, Loops: Learning Composable World Models for ARC

Add code
Jun 10, 2026
Viaarxiv icon

Echoes of the Prior: A Computational Phenomenology of Forgetting

Add code
Jun 10, 2026
Viaarxiv icon

PaperMentor: A Human-Centered Multi-Agent Writing Tutor for AI Research Papers on Overleaf

Add code
Jun 07, 2026
Viaarxiv icon

STRIDE: Training Data Attribution via Sparse Recovery from Subset Perturbations

Add code
Jun 03, 2026
Viaarxiv icon

PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

Add code
May 27, 2026
Viaarxiv icon

Learning to Reason Efficiently with A* Post-Training

Add code
May 23, 2026
Viaarxiv icon

Riemannian Networks over Full-Rank Correlation Matrices

Add code
May 18, 2026
Viaarxiv icon