Alert button
Picture for Aleksandar Botev

Aleksandar Botev

Alert button

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Bookmark button
Alert button
Apr 11, 2024
Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Armand Joulin, Noah Fiedel, Evan Senter, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, David Budden, Arnaud Doucet, Sharad Vikram, Adam Paszke, Trevor Gale, Sebastian Borgeaud, Charlie Chen, Andy Brock, Antonia Paterson, Jenny Brennan, Meg Risdal, Raj Gundluru, Nesh Devanathan, Paul Mooney, Nilay Chauhan, Phil Culliton, Luiz GUStavo Martins, Elisa Bandy, David Huntsperger, Glenn Cameron, Arthur Zucker, Tris Warkentin, Ludovic Peran, Minh Giang, Zoubin Ghahramani, Clément Farabet, Koray Kavukcuoglu, Demis Hassabis, Raia Hadsell, Yee Whye Teh, Nando de Frietas

Viaarxiv icon

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George Cristian-Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando De Freitas, Caglar Gulcehre

Viaarxiv icon

Applications of flow models to the generation of correlated lattice QCD ensembles

Add code
Bookmark button
Alert button
Jan 19, 2024
Ryan Abbott, Aleksandar Botev, Denis Boyda, Daniel C. Hackett, Gurtej Kanwar, Sébastien Racanière, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

Viaarxiv icon

Normalizing flows for lattice gauge theory in arbitrary space-time dimension

Add code
Bookmark button
Alert button
May 03, 2023
Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Gurtej Kanwar, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

Figure 1 for Normalizing flows for lattice gauge theory in arbitrary space-time dimension
Figure 2 for Normalizing flows for lattice gauge theory in arbitrary space-time dimension
Figure 3 for Normalizing flows for lattice gauge theory in arbitrary space-time dimension
Figure 4 for Normalizing flows for lattice gauge theory in arbitrary space-time dimension
Viaarxiv icon

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation

Add code
Bookmark button
Alert button
Feb 20, 2023
Bobby He, James Martens, Guodong Zhang, Aleksandar Botev, Andrew Brock, Samuel L Smith, Yee Whye Teh

Figure 1 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 2 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 3 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 4 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Viaarxiv icon

Aspects of scaling and scalability for flow-based sampling of lattice QCD

Add code
Bookmark button
Alert button
Nov 14, 2022
Ryan Abbott, Michael S. Albergo, Aleksandar Botev, Denis Boyda, Kyle Cranmer, Daniel C. Hackett, Alexander G. D. G. Matthews, Sébastien Racanière, Ali Razavi, Danilo J. Rezende, Fernando Romero-López, Phiala E. Shanahan, Julian M. Urban

Figure 1 for Aspects of scaling and scalability for flow-based sampling of lattice QCD
Figure 2 for Aspects of scaling and scalability for flow-based sampling of lattice QCD
Figure 3 for Aspects of scaling and scalability for flow-based sampling of lattice QCD
Figure 4 for Aspects of scaling and scalability for flow-based sampling of lattice QCD
Viaarxiv icon

Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers

Add code
Bookmark button
Alert button
Mar 15, 2022
Guodong Zhang, Aleksandar Botev, James Martens

Figure 1 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 2 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 3 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Figure 4 for Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers
Viaarxiv icon

SyMetric: Measuring the Quality of Learnt Hamiltonian Dynamics Inferred from Vision

Add code
Bookmark button
Alert button
Nov 10, 2021
Irina Higgins, Peter Wirnsberger, Andrew Jaegle, Aleksandar Botev

Figure 1 for SyMetric: Measuring the Quality of Learnt Hamiltonian Dynamics Inferred from Vision
Figure 2 for SyMetric: Measuring the Quality of Learnt Hamiltonian Dynamics Inferred from Vision
Figure 3 for SyMetric: Measuring the Quality of Learnt Hamiltonian Dynamics Inferred from Vision
Figure 4 for SyMetric: Measuring the Quality of Learnt Hamiltonian Dynamics Inferred from Vision
Viaarxiv icon

Which priors matter? Benchmarking models for learning latent dynamics

Add code
Bookmark button
Alert button
Nov 09, 2021
Aleksandar Botev, Andrew Jaegle, Peter Wirnsberger, Daniel Hennes, Irina Higgins

Figure 1 for Which priors matter? Benchmarking models for learning latent dynamics
Figure 2 for Which priors matter? Benchmarking models for learning latent dynamics
Figure 3 for Which priors matter? Benchmarking models for learning latent dynamics
Viaarxiv icon

Better, Faster Fermionic Neural Networks

Add code
Bookmark button
Alert button
Nov 13, 2020
James S. Spencer, David Pfau, Aleksandar Botev, W. M. C. Foulkes

Figure 1 for Better, Faster Fermionic Neural Networks
Figure 2 for Better, Faster Fermionic Neural Networks
Figure 3 for Better, Faster Fermionic Neural Networks
Figure 4 for Better, Faster Fermionic Neural Networks
Viaarxiv icon