Picture for Simon Lacoste-Julien

Simon Lacoste-Julien

DIRO, MILA

On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization

Add code
Jun 07, 2024
Viaarxiv icon

Nonparametric Partial Disentanglement via Mechanism Sparsity: Sparse Actions, Interventions and Sparse Temporal Dependencies

Add code
Jan 10, 2024
Viaarxiv icon

Weight-Sharing Regularization

Add code
Nov 06, 2023
Figure 1 for Weight-Sharing Regularization
Figure 2 for Weight-Sharing Regularization
Figure 3 for Weight-Sharing Regularization
Figure 4 for Weight-Sharing Regularization
Viaarxiv icon

Balancing Act: Constraining Disparate Impact in Sparse Models

Add code
Oct 31, 2023
Viaarxiv icon

Promoting Exploration in Memory-Augmented Adam using Critical Momenta

Add code
Jul 18, 2023
Figure 1 for Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Figure 2 for Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Figure 3 for Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Figure 4 for Promoting Exploration in Memory-Augmented Adam using Critical Momenta
Viaarxiv icon

Additive Decoders for Latent Variables Identification and Cartesian-Product Extrapolation

Add code
Jul 05, 2023
Viaarxiv icon

Identifiability of Discretized Latent Coordinate Systems via Density Landmarks Detection

Jun 28, 2023
Figure 1 for Identifiability of Discretized Latent Coordinate Systems via Density Landmarks Detection
Figure 2 for Identifiability of Discretized Latent Coordinate Systems via Density Landmarks Detection
Figure 3 for Identifiability of Discretized Latent Coordinate Systems via Density Landmarks Detection
Figure 4 for Identifiability of Discretized Latent Coordinate Systems via Density Landmarks Detection
Viaarxiv icon

PopulAtion Parameter Averaging (PAPA)

Add code
Apr 06, 2023
Figure 1 for PopulAtion Parameter Averaging (PAPA)
Figure 2 for PopulAtion Parameter Averaging (PAPA)
Figure 3 for PopulAtion Parameter Averaging (PAPA)
Figure 4 for PopulAtion Parameter Averaging (PAPA)
Viaarxiv icon

Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?

Add code
Mar 07, 2023
Figure 1 for Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?
Figure 2 for Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?
Figure 3 for Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?
Figure 4 for Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?
Viaarxiv icon

Unlocking Slot Attention by Changing Optimal Transport Costs

Add code
Jan 30, 2023
Figure 1 for Unlocking Slot Attention by Changing Optimal Transport Costs
Viaarxiv icon