Picture for Dimitris Papailiopoulos

Dimitris Papailiopoulos

The Expressive Power of Tuning Only the Norm Layers

Add code
Feb 15, 2023
Viaarxiv icon

Looped Transformers as Programmable Computers

Add code
Jan 30, 2023
Figure 1 for Looped Transformers as Programmable Computers
Figure 2 for Looped Transformers as Programmable Computers
Figure 3 for Looped Transformers as Programmable Computers
Figure 4 for Looped Transformers as Programmable Computers
Viaarxiv icon

Transformers as Algorithms: Generalization and Implicit Model Selection in In-context Learning

Add code
Jan 17, 2023
Figure 1 for Transformers as Algorithms: Generalization and Implicit Model Selection in In-context Learning
Figure 2 for Transformers as Algorithms: Generalization and Implicit Model Selection in In-context Learning
Figure 3 for Transformers as Algorithms: Generalization and Implicit Model Selection in In-context Learning
Figure 4 for Transformers as Algorithms: Generalization and Implicit Model Selection in In-context Learning
Viaarxiv icon

A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets

Add code
Oct 06, 2022
Figure 1 for A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets
Figure 2 for A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets
Figure 3 for A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets
Figure 4 for A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets
Viaarxiv icon

LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks

Add code
Jun 15, 2022
Figure 1 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 2 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 3 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 4 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Viaarxiv icon

Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment

Add code
May 23, 2022
Figure 1 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 2 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 3 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 4 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Viaarxiv icon

Rare Gems: Finding Lottery Tickets at Initialization

Add code
Feb 24, 2022
Figure 1 for Rare Gems: Finding Lottery Tickets at Initialization
Figure 2 for Rare Gems: Finding Lottery Tickets at Initialization
Figure 3 for Rare Gems: Finding Lottery Tickets at Initialization
Figure 4 for Rare Gems: Finding Lottery Tickets at Initialization
Viaarxiv icon

GenLabel: Mixup Relabeling using Generative Models

Add code
Jan 07, 2022
Figure 1 for GenLabel: Mixup Relabeling using Generative Models
Figure 2 for GenLabel: Mixup Relabeling using Generative Models
Figure 3 for GenLabel: Mixup Relabeling using Generative Models
Figure 4 for GenLabel: Mixup Relabeling using Generative Models
Viaarxiv icon

Finding Everything within Random Binary Networks

Add code
Oct 22, 2021
Figure 1 for Finding Everything within Random Binary Networks
Figure 2 for Finding Everything within Random Binary Networks
Figure 3 for Finding Everything within Random Binary Networks
Figure 4 for Finding Everything within Random Binary Networks
Viaarxiv icon

An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks

Add code
Jun 14, 2021
Figure 1 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 2 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 3 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 4 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Viaarxiv icon