Picture for Samy Jelassi

Samy Jelassi

DMA, CIMS

LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

Add code
Oct 16, 2024
Figure 1 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 2 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 3 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Figure 4 for LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
Viaarxiv icon

Universal Length Generalization with Turing Programs

Add code
Jul 03, 2024
Viaarxiv icon

How Does Overparameterization Affect Features?

Add code
Jul 01, 2024
Viaarxiv icon

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Add code
Feb 22, 2024
Viaarxiv icon

Repeat After Me: Transformers are Better than State Space Models at Copying

Add code
Feb 01, 2024
Viaarxiv icon

Length Generalization in Arithmetic Transformers

Add code
Jun 27, 2023
Viaarxiv icon

Depth Dependence of $μ$P Learning Rates in ReLU MLPs

Add code
May 13, 2023
Viaarxiv icon

Vision Transformers provably learn spatial structure

Add code
Oct 13, 2022
Figure 1 for Vision Transformers provably learn spatial structure
Figure 2 for Vision Transformers provably learn spatial structure
Figure 3 for Vision Transformers provably learn spatial structure
Figure 4 for Vision Transformers provably learn spatial structure
Viaarxiv icon

Dissecting adaptive methods in GANs

Add code
Oct 09, 2022
Figure 1 for Dissecting adaptive methods in GANs
Figure 2 for Dissecting adaptive methods in GANs
Figure 3 for Dissecting adaptive methods in GANs
Figure 4 for Dissecting adaptive methods in GANs
Viaarxiv icon

Towards understanding how momentum improves generalization in deep learning

Add code
Jul 13, 2022
Figure 1 for Towards understanding how momentum improves generalization in deep learning
Figure 2 for Towards understanding how momentum improves generalization in deep learning
Figure 3 for Towards understanding how momentum improves generalization in deep learning
Figure 4 for Towards understanding how momentum improves generalization in deep learning
Viaarxiv icon