Alert button
Picture for Daniel M. Roy

Daniel M. Roy

Alert button

Simultaneous linear connectivity of neural networks modulo permutation

Add code
Bookmark button
Alert button
Apr 09, 2024
Ekansh Sharma, Devin Kwok, Tom Denton, Daniel M. Roy, David Rolnick, Gintare Karolina Dziugaite

Viaarxiv icon

Information Complexity of Stochastic Convex Optimization: Applications to Generalization and Memorization

Add code
Bookmark button
Alert button
Feb 14, 2024
Idan Attias, Gintare Karolina Dziugaite, Mahdi Haghifam, Roi Livni, Daniel M. Roy

Viaarxiv icon

The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit

Add code
Bookmark button
Alert button
Jun 30, 2023
Lorenzo Noci, Chuning Li, Mufan Bill Li, Bobby He, Thomas Hofmann, Chris Maddison, Daniel M. Roy

Figure 1 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 2 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 3 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 4 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Viaarxiv icon

Limitations of Information-Theoretic Generalization Bounds for Gradient Descent Methods in Stochastic Convex Optimization

Add code
Bookmark button
Alert button
Dec 27, 2022
Mahdi Haghifam, Borja Rodríguez-Gálvez, Ragnar Thobaben, Mikael Skoglund, Daniel M. Roy, Gintare Karolina Dziugaite

Figure 1 for Limitations of Information-Theoretic Generalization Bounds for Gradient Descent Methods in Stochastic Convex Optimization
Figure 2 for Limitations of Information-Theoretic Generalization Bounds for Gradient Descent Methods in Stochastic Convex Optimization
Viaarxiv icon

Pruning's Effect on Generalization Through the Lens of Training and Regularization

Add code
Bookmark button
Alert button
Oct 25, 2022
Tian Jin, Michael Carbin, Daniel M. Roy, Jonathan Frankle, Gintare Karolina Dziugaite

Figure 1 for Pruning's Effect on Generalization Through the Lens of Training and Regularization
Figure 2 for Pruning's Effect on Generalization Through the Lens of Training and Regularization
Figure 3 for Pruning's Effect on Generalization Through the Lens of Training and Regularization
Figure 4 for Pruning's Effect on Generalization Through the Lens of Training and Regularization
Viaarxiv icon

Statistical Inference with Stochastic Gradient Algorithms

Add code
Bookmark button
Alert button
Jul 25, 2022
Jeffrey Negrea, Jun Yang, Haoyue Feng, Daniel M. Roy, Jonathan H. Huggins

Figure 1 for Statistical Inference with Stochastic Gradient Algorithms
Figure 2 for Statistical Inference with Stochastic Gradient Algorithms
Figure 3 for Statistical Inference with Stochastic Gradient Algorithms
Figure 4 for Statistical Inference with Stochastic Gradient Algorithms
Viaarxiv icon

Understanding Generalization via Leave-One-Out Conditional Mutual Information

Add code
Bookmark button
Alert button
Jun 29, 2022
Mahdi Haghifam, Shay Moran, Daniel M. Roy, Gintare Karolina Dziugaite

Figure 1 for Understanding Generalization via Leave-One-Out Conditional Mutual Information
Figure 2 for Understanding Generalization via Leave-One-Out Conditional Mutual Information
Viaarxiv icon

The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization

Add code
Bookmark button
Alert button
Jun 06, 2022
Mufan Bill Li, Mihai Nica, Daniel M. Roy

Figure 1 for The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization
Figure 2 for The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization
Figure 3 for The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization
Figure 4 for The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization
Viaarxiv icon

Adaptively Exploiting d-Separators with Causal Bandits

Add code
Bookmark button
Alert button
Feb 10, 2022
Blair Bilodeau, Linbo Wang, Daniel M. Roy

Figure 1 for Adaptively Exploiting d-Separators with Causal Bandits
Figure 2 for Adaptively Exploiting d-Separators with Causal Bandits
Figure 3 for Adaptively Exploiting d-Separators with Causal Bandits
Viaarxiv icon

Towards a Unified Information-Theoretic Framework for Generalization

Add code
Bookmark button
Alert button
Nov 17, 2021
Mahdi Haghifam, Gintare Karolina Dziugaite, Shay Moran, Daniel M. Roy

Figure 1 for Towards a Unified Information-Theoretic Framework for Generalization
Viaarxiv icon