Alert button
Picture for Andrey Zhmoginov

Andrey Zhmoginov

Alert button

Continual Few-Shot Learning Using HyperTransformers

Jan 12, 2023
Max Vladymyrov, Andrey Zhmoginov, Mark Sandler

Figure 1 for Continual Few-Shot Learning Using HyperTransformers
Figure 2 for Continual Few-Shot Learning Using HyperTransformers
Figure 3 for Continual Few-Shot Learning Using HyperTransformers
Figure 4 for Continual Few-Shot Learning Using HyperTransformers
Viaarxiv icon

Training trajectories, mini-batch losses and the curious role of the learning rate

Jan 05, 2023
Mark Sandler, Andrey Zhmoginov, Max Vladymyrov, Nolan Miller

Figure 1 for Training trajectories, mini-batch losses and the curious role of the learning rate
Figure 2 for Training trajectories, mini-batch losses and the curious role of the learning rate
Figure 3 for Training trajectories, mini-batch losses and the curious role of the learning rate
Figure 4 for Training trajectories, mini-batch losses and the curious role of the learning rate
Viaarxiv icon

Transformers learn in-context by gradient descent

Dec 15, 2022
Johannes von Oswald, Eyvind Niklasson, Ettore Randazzo, João Sacramento, Alexander Mordvintsev, Andrey Zhmoginov, Max Vladymyrov

Figure 1 for Transformers learn in-context by gradient descent
Figure 2 for Transformers learn in-context by gradient descent
Figure 3 for Transformers learn in-context by gradient descent
Figure 4 for Transformers learn in-context by gradient descent
Viaarxiv icon

Decentralized Learning with Multi-Headed Distillation

Nov 28, 2022
Andrey Zhmoginov, Mark Sandler, Nolan Miller, Gus Kristiansen, Max Vladymyrov

Figure 1 for Decentralized Learning with Multi-Headed Distillation
Figure 2 for Decentralized Learning with Multi-Headed Distillation
Figure 3 for Decentralized Learning with Multi-Headed Distillation
Figure 4 for Decentralized Learning with Multi-Headed Distillation
Viaarxiv icon

Fine-tuning Image Transformers using Learnable Memory

Mar 30, 2022
Mark Sandler, Andrey Zhmoginov, Max Vladymyrov, Andrew Jackson

Figure 1 for Fine-tuning Image Transformers using Learnable Memory
Figure 2 for Fine-tuning Image Transformers using Learnable Memory
Figure 3 for Fine-tuning Image Transformers using Learnable Memory
Figure 4 for Fine-tuning Image Transformers using Learnable Memory
Viaarxiv icon

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning

Jan 15, 2022
Andrey Zhmoginov, Mark Sandler, Max Vladymyrov

Figure 1 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 2 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 3 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Figure 4 for HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
Viaarxiv icon

Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks

Jul 23, 2021
Andrey Zhmoginov, Dina Bashkirova, Mark Sandler

Figure 1 for Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks
Figure 2 for Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks
Figure 3 for Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks
Figure 4 for Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks
Viaarxiv icon

BasisNet: Two-stage Model Synthesis for Efficient Inference

May 07, 2021
Mingda Zhang, Chun-Te Chu, Andrey Zhmoginov, Andrew Howard, Brendan Jou, Yukun Zhu, Li Zhang, Rebecca Hwa, Adriana Kovashka

Figure 1 for BasisNet: Two-stage Model Synthesis for Efficient Inference
Figure 2 for BasisNet: Two-stage Model Synthesis for Efficient Inference
Figure 3 for BasisNet: Two-stage Model Synthesis for Efficient Inference
Figure 4 for BasisNet: Two-stage Model Synthesis for Efficient Inference
Viaarxiv icon

Meta-Learning Bidirectional Update Rules

Apr 10, 2021
Mark Sandler, Max Vladymyrov, Andrey Zhmoginov, Nolan Miller, Andrew Jackson, Tom Madams, Blaise Aguera y Arcas

Figure 1 for Meta-Learning Bidirectional Update Rules
Figure 2 for Meta-Learning Bidirectional Update Rules
Figure 3 for Meta-Learning Bidirectional Update Rules
Figure 4 for Meta-Learning Bidirectional Update Rules
Viaarxiv icon