Picture for Eugene Belilovsky

Eugene Belilovsky

MILA

Not Only the Last-Layer Features for Spurious Correlations: All Layer Deep Feature Reweighting

Add code
Sep 23, 2024
Figure 1 for Not Only the Last-Layer Features for Spurious Correlations: All Layer Deep Feature Reweighting
Figure 2 for Not Only the Last-Layer Features for Spurious Correlations: All Layer Deep Feature Reweighting
Figure 3 for Not Only the Last-Layer Features for Spurious Correlations: All Layer Deep Feature Reweighting
Figure 4 for Not Only the Last-Layer Features for Spurious Correlations: All Layer Deep Feature Reweighting
Viaarxiv icon

Accelerating Training with Neuron Interaction and Nowcasting Networks

Add code
Sep 06, 2024
Figure 1 for Accelerating Training with Neuron Interaction and Nowcasting Networks
Figure 2 for Accelerating Training with Neuron Interaction and Nowcasting Networks
Figure 3 for Accelerating Training with Neuron Interaction and Nowcasting Networks
Figure 4 for Accelerating Training with Neuron Interaction and Nowcasting Networks
Viaarxiv icon

Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

Add code
Jul 07, 2024
Viaarxiv icon

Controlling Forgetting with Test-Time Data in Continual Learning

Add code
Jun 19, 2024
Viaarxiv icon

PETRA: Parallel End-to-end Training with Reversible Architectures

Add code
Jun 04, 2024
Viaarxiv icon

ACCO: Accumulate while you Communicate, Hiding Communications in Distributed LLM Training

Add code
Jun 03, 2024
Viaarxiv icon

From Feature Visualization to Visual Circuits: Effect of Adversarial Model Manipulation

Add code
Jun 03, 2024
Viaarxiv icon

Temporally Consistent Object Editing in Videos using Extended Attention

Add code
Jun 01, 2024
Viaarxiv icon

$μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers

Add code
May 31, 2024
Figure 1 for $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Figure 2 for $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Figure 3 for $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Figure 4 for $μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers
Viaarxiv icon

WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average

Add code
May 27, 2024
Figure 1 for WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Figure 2 for WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Figure 3 for WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Figure 4 for WASH: Train your Ensemble with Communication-Efficient Weight Shuffling, then Average
Viaarxiv icon