Alert button
Picture for Antonio Orvieto

Antonio Orvieto

Alert button

On the low-shot transferability of [V]-Mamba

Add code
Bookmark button
Alert button
Mar 15, 2024
Diganta Misra, Jay Gala, Antonio Orvieto

Figure 1 for On the low-shot transferability of [V]-Mamba
Figure 2 for On the low-shot transferability of [V]-Mamba
Figure 3 for On the low-shot transferability of [V]-Mamba
Figure 4 for On the low-shot transferability of [V]-Mamba
Viaarxiv icon

Theoretical Foundations of Deep Selective State-Space Models

Add code
Bookmark button
Alert button
Mar 04, 2024
Nicola Muca Cirone, Antonio Orvieto, Benjamin Walker, Cristopher Salvi, Terry Lyons

Viaarxiv icon

Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning

Add code
Bookmark button
Alert button
Feb 27, 2024
Lorenzo Noci, Alexandru Meterez, Thomas Hofmann, Antonio Orvieto

Viaarxiv icon

SDEs for Minimax Optimization

Add code
Bookmark button
Alert button
Feb 19, 2024
Enea Monzio Compagnoni, Antonio Orvieto, Hans Kersting, Frank Norbert Proske, Aurelien Lucchi

Viaarxiv icon

Recurrent Distance-Encoding Neural Networks for Graph Representation Learning

Add code
Bookmark button
Alert button
Dec 03, 2023
Yuhui Ding, Antonio Orvieto, Bobby He, Thomas Hofmann

Viaarxiv icon

On the Universality of Linear Recurrences Followed by Nonlinear Projections

Add code
Bookmark button
Alert button
Jul 21, 2023
Antonio Orvieto, Soham De, Caglar Gulcehre, Razvan Pascanu, Samuel L. Smith

Figure 1 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 2 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 3 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Figure 4 for On the Universality of Linear Recurrences Followed by Nonlinear Projections
Viaarxiv icon

Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning

Add code
Bookmark button
Alert button
Mar 31, 2023
Sanghwan Kim, Lorenzo Noci, Antonio Orvieto, Thomas Hofmann

Figure 1 for Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning
Figure 2 for Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning
Figure 3 for Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning
Figure 4 for Achieving a Better Stability-Plasticity Trade-off via Auxiliary Networks in Continual Learning
Viaarxiv icon

Resurrecting Recurrent Neural Networks for Long Sequences

Add code
Bookmark button
Alert button
Mar 11, 2023
Antonio Orvieto, Samuel L Smith, Albert Gu, Anushan Fernando, Caglar Gulcehre, Razvan Pascanu, Soham De

Figure 1 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 2 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 3 for Resurrecting Recurrent Neural Networks for Long Sequences
Figure 4 for Resurrecting Recurrent Neural Networks for Long Sequences
Viaarxiv icon