Alert button
Picture for João Sacramento

João Sacramento

Alert button

Discovering modular solutions that generalize compositionally

Add code
Bookmark button
Alert button
Dec 22, 2023
Simon Schug, Seijin Kobayashi, Yassir Akram, Maciej Wołczyk, Alexandra Proca, Johannes von Oswald, Razvan Pascanu, João Sacramento, Angelika Steger

Viaarxiv icon

Uncovering mesa-optimization algorithms in Transformers

Add code
Bookmark button
Alert button
Sep 11, 2023
Johannes von Oswald, Eyvind Niklasson, Maximilian Schlegel, Seijin Kobayashi, Nicolas Zucchet, Nino Scherrer, Nolan Miller, Mark Sandler, Blaise Agüera y Arcas, Max Vladymyrov, Razvan Pascanu, João Sacramento

Viaarxiv icon

Gated recurrent neural networks discover attention

Add code
Bookmark button
Alert button
Sep 04, 2023
Nicolas Zucchet, Seijin Kobayashi, Yassir Akram, Johannes von Oswald, Maxime Larcher, Angelika Steger, João Sacramento

Figure 1 for Gated recurrent neural networks discover attention
Figure 2 for Gated recurrent neural networks discover attention
Figure 3 for Gated recurrent neural networks discover attention
Figure 4 for Gated recurrent neural networks discover attention
Viaarxiv icon

Online learning of long-range dependencies

Add code
Bookmark button
Alert button
May 25, 2023
Nicolas Zucchet, Robert Meier, Simon Schug, Asier Mujika, João Sacramento

Figure 1 for Online learning of long-range dependencies
Figure 2 for Online learning of long-range dependencies
Figure 3 for Online learning of long-range dependencies
Figure 4 for Online learning of long-range dependencies
Viaarxiv icon

Transformers learn in-context by gradient descent

Add code
Bookmark button
Alert button
Dec 15, 2022
Johannes von Oswald, Eyvind Niklasson, Ettore Randazzo, João Sacramento, Alexander Mordvintsev, Andrey Zhmoginov, Max Vladymyrov

Figure 1 for Transformers learn in-context by gradient descent
Figure 2 for Transformers learn in-context by gradient descent
Figure 3 for Transformers learn in-context by gradient descent
Figure 4 for Transformers learn in-context by gradient descent
Viaarxiv icon

The least-control principle for learning at equilibrium

Add code
Bookmark button
Alert button
Jul 04, 2022
Alexander Meulemans, Nicolas Zucchet, Seijin Kobayashi, Johannes von Oswald, João Sacramento

Figure 1 for The least-control principle for learning at equilibrium
Figure 2 for The least-control principle for learning at equilibrium
Figure 3 for The least-control principle for learning at equilibrium
Figure 4 for The least-control principle for learning at equilibrium
Viaarxiv icon

Beyond backpropagation: implicit gradients for bilevel optimization

Add code
Bookmark button
Alert button
May 06, 2022
Nicolas Zucchet, João Sacramento

Figure 1 for Beyond backpropagation: implicit gradients for bilevel optimization
Figure 2 for Beyond backpropagation: implicit gradients for bilevel optimization
Figure 3 for Beyond backpropagation: implicit gradients for bilevel optimization
Viaarxiv icon

Minimizing Control for Credit Assignment with Strong Feedback

Add code
Bookmark button
Alert button
Apr 14, 2022
Alexander Meulemans, Matilde Tristany Farinha, Maria R. Cervera, João Sacramento, Benjamin F. Grewe

Figure 1 for Minimizing Control for Credit Assignment with Strong Feedback
Figure 2 for Minimizing Control for Credit Assignment with Strong Feedback
Figure 3 for Minimizing Control for Credit Assignment with Strong Feedback
Figure 4 for Minimizing Control for Credit Assignment with Strong Feedback
Viaarxiv icon

Learning where to learn: Gradient sparsity in meta and continual learning

Add code
Bookmark button
Alert button
Oct 27, 2021
Johannes von Oswald, Dominic Zhao, Seijin Kobayashi, Simon Schug, Massimo Caccia, Nicolas Zucchet, João Sacramento

Figure 1 for Learning where to learn: Gradient sparsity in meta and continual learning
Figure 2 for Learning where to learn: Gradient sparsity in meta and continual learning
Figure 3 for Learning where to learn: Gradient sparsity in meta and continual learning
Figure 4 for Learning where to learn: Gradient sparsity in meta and continual learning
Viaarxiv icon

Credit Assignment in Neural Networks through Deep Feedback Control

Add code
Bookmark button
Alert button
Jun 15, 2021
Alexander Meulemans, Matilde Tristany Farinha, Javier García Ordóñez, Pau Vilimelis Aceituno, João Sacramento, Benjamin F. Grewe

Figure 1 for Credit Assignment in Neural Networks through Deep Feedback Control
Figure 2 for Credit Assignment in Neural Networks through Deep Feedback Control
Figure 3 for Credit Assignment in Neural Networks through Deep Feedback Control
Figure 4 for Credit Assignment in Neural Networks through Deep Feedback Control
Viaarxiv icon