Alert button
Picture for Jürgen Schmidhuber

Jürgen Schmidhuber

Alert button

Learning Useful Representations of Recurrent Neural Network Weight Matrices

Mar 18, 2024
Vincent Herrmann, Francesco Faccio, Jürgen Schmidhuber

Viaarxiv icon

Language Agents as Optimizable Graphs

Feb 27, 2024
Mingchen Zhuge, Wenyi Wang, Louis Kirsch, Francesco Faccio, Dmitrii Khizbullin, Jürgen Schmidhuber

Viaarxiv icon

SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Dec 14, 2023
Róbert Csordás, Piotr Piękos, Kazuki Irie, Jürgen Schmidhuber

Viaarxiv icon

Automating Continual Learning

Dec 01, 2023
Kazuki Irie, Róbert Csordás, Jürgen Schmidhuber

Viaarxiv icon

Efficient Rotation Invariance in Deep Neural Networks through Artificial Mental Rotation

Nov 14, 2023
Lukas Tuggener, Thilo Stadelmann, Jürgen Schmidhuber

Viaarxiv icon

Unsupervised Musical Object Discovery from Audio

Nov 14, 2023
Joonsu Gha, Vincent Herrmann, Benjamin Grewe, Jürgen Schmidhuber, Anand Gopalakrishnan

Viaarxiv icon

Practical Computational Power of Linear Transformers and Their Recurrent and Self-Referential Extensions

Oct 24, 2023
Kazuki Irie, Róbert Csordás, Jürgen Schmidhuber

Viaarxiv icon

Approximating Two-Layer Feedforward Networks for Efficient Transformers

Oct 23, 2023
Róbert Csordás, Kazuki Irie, Jürgen Schmidhuber

Viaarxiv icon

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

Sep 20, 2023
Aleksandar Stanić, Dylan Ashley, Oleg Serikov, Louis Kirsch, Francesco Faccio, Jürgen Schmidhuber, Thomas Hofmann, Imanol Schlag

Figure 1 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 2 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 3 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Figure 4 for The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Viaarxiv icon