Picture for Alexander Meulemans

Alexander Meulemans

Institute of Neuroinformatics, ETH Zürich and University of Zürich, Zürich, Switzerland

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Add code
Dec 24, 2025
Viaarxiv icon

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Add code
Jun 05, 2025
Viaarxiv icon

Multi-agent cooperation through learning-aware policy gradients

Add code
Oct 24, 2024
Figure 1 for Multi-agent cooperation through learning-aware policy gradients
Figure 2 for Multi-agent cooperation through learning-aware policy gradients
Figure 3 for Multi-agent cooperation through learning-aware policy gradients
Figure 4 for Multi-agent cooperation through learning-aware policy gradients
Viaarxiv icon

Structured Entity Extraction Using Large Language Models

Add code
Feb 06, 2024
Figure 1 for Structured Entity Extraction Using Large Language Models
Figure 2 for Structured Entity Extraction Using Large Language Models
Figure 3 for Structured Entity Extraction Using Large Language Models
Figure 4 for Structured Entity Extraction Using Large Language Models
Viaarxiv icon

Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis

Add code
Jun 29, 2023
Figure 1 for Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Figure 2 for Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Figure 3 for Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Figure 4 for Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Viaarxiv icon

The least-control principle for learning at equilibrium

Add code
Jul 04, 2022
Figure 1 for The least-control principle for learning at equilibrium
Figure 2 for The least-control principle for learning at equilibrium
Figure 3 for The least-control principle for learning at equilibrium
Figure 4 for The least-control principle for learning at equilibrium
Viaarxiv icon

Minimizing Control for Credit Assignment with Strong Feedback

Add code
Apr 14, 2022
Figure 1 for Minimizing Control for Credit Assignment with Strong Feedback
Figure 2 for Minimizing Control for Credit Assignment with Strong Feedback
Figure 3 for Minimizing Control for Credit Assignment with Strong Feedback
Figure 4 for Minimizing Control for Credit Assignment with Strong Feedback
Viaarxiv icon

Credit Assignment in Neural Networks through Deep Feedback Control

Add code
Jun 15, 2021
Figure 1 for Credit Assignment in Neural Networks through Deep Feedback Control
Figure 2 for Credit Assignment in Neural Networks through Deep Feedback Control
Figure 3 for Credit Assignment in Neural Networks through Deep Feedback Control
Figure 4 for Credit Assignment in Neural Networks through Deep Feedback Control
Viaarxiv icon

Challenges for Using Impact Regularizers to Avoid Negative Side Effects

Add code
Feb 23, 2021
Viaarxiv icon

A Theoretical Framework for Target Propagation

Add code
Jun 25, 2020
Figure 1 for A Theoretical Framework for Target Propagation
Figure 2 for A Theoretical Framework for Target Propagation
Figure 3 for A Theoretical Framework for Target Propagation
Figure 4 for A Theoretical Framework for Target Propagation
Viaarxiv icon