Picture for Frank Schneider

Frank Schneider

Mitigating Forgetting in Low Rank Adaptation

Add code
Dec 19, 2025
Viaarxiv icon

Connecting Parameter Magnitudes and Hessian Eigenspaces at Scale using Sketched Methods

Add code
Apr 20, 2025
Viaarxiv icon

Accelerating Neural Network Training: An Analysis of the AlgoPerf Competition

Add code
Feb 20, 2025
Figure 1 for Accelerating Neural Network Training: An Analysis of the AlgoPerf Competition
Figure 2 for Accelerating Neural Network Training: An Analysis of the AlgoPerf Competition
Figure 3 for Accelerating Neural Network Training: An Analysis of the AlgoPerf Competition
Figure 4 for Accelerating Neural Network Training: An Analysis of the AlgoPerf Competition
Viaarxiv icon

Efficient Weight-Space Laplace-Gaussian Filtering and Smoothing for Sequential Deep Learning

Add code
Oct 09, 2024
Figure 1 for Efficient Weight-Space Laplace-Gaussian Filtering and Smoothing for Sequential Deep Learning
Figure 2 for Efficient Weight-Space Laplace-Gaussian Filtering and Smoothing for Sequential Deep Learning
Figure 3 for Efficient Weight-Space Laplace-Gaussian Filtering and Smoothing for Sequential Deep Learning
Figure 4 for Efficient Weight-Space Laplace-Gaussian Filtering and Smoothing for Sequential Deep Learning
Viaarxiv icon

Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures

Add code
Nov 01, 2023
Figure 1 for Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
Figure 2 for Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
Figure 3 for Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
Figure 4 for Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures
Viaarxiv icon

Accelerating Generalized Linear Models by Trading off Computation for Uncertainty

Add code
Oct 31, 2023
Figure 1 for Accelerating Generalized Linear Models by Trading off Computation for Uncertainty
Figure 2 for Accelerating Generalized Linear Models by Trading off Computation for Uncertainty
Figure 3 for Accelerating Generalized Linear Models by Trading off Computation for Uncertainty
Figure 4 for Accelerating Generalized Linear Models by Trading off Computation for Uncertainty
Viaarxiv icon

Benchmarking Neural Network Training Algorithms

Add code
Jun 12, 2023
Figure 1 for Benchmarking Neural Network Training Algorithms
Figure 2 for Benchmarking Neural Network Training Algorithms
Figure 3 for Benchmarking Neural Network Training Algorithms
Figure 4 for Benchmarking Neural Network Training Algorithms
Viaarxiv icon

Cockpit: A Practical Debugging Tool for Training Deep Neural Networks

Add code
Feb 12, 2021
Figure 1 for Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Figure 2 for Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Figure 3 for Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Figure 4 for Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Viaarxiv icon

Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers

Add code
Jul 07, 2020
Figure 1 for Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers
Figure 2 for Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers
Figure 3 for Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers
Figure 4 for Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers
Viaarxiv icon

DeepOBS: A Deep Learning Optimizer Benchmark Suite

Add code
Mar 13, 2019
Figure 1 for DeepOBS: A Deep Learning Optimizer Benchmark Suite
Figure 2 for DeepOBS: A Deep Learning Optimizer Benchmark Suite
Figure 3 for DeepOBS: A Deep Learning Optimizer Benchmark Suite
Figure 4 for DeepOBS: A Deep Learning Optimizer Benchmark Suite
Viaarxiv icon