Alert button
Picture for Felix Dangel

Felix Dangel

Alert button

Lowering PyTorch's Memory Consumption for Selective Differentiation

Add code
Bookmark button
Alert button
Apr 15, 2024
Samarth Bhatia, Felix Dangel

Viaarxiv icon

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Add code
Bookmark button
Alert button
Feb 13, 2024
Wu Lin, Felix Dangel, Runa Eschenhagen, Juhan Bae, Richard E. Turner, Alireza Makhzani

Viaarxiv icon

Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC for Large Neural Nets

Add code
Bookmark button
Alert button
Dec 16, 2023
Wu Lin, Felix Dangel, Runa Eschenhagen, Kirill Neklyudov, Agustinus Kristiadi, Richard E. Turner, Alireza Makhzani

Viaarxiv icon

On the Disconnect Between Theory and Practice of Overparametrized Neural Networks

Add code
Bookmark button
Alert button
Sep 29, 2023
Jonathan Wenger, Felix Dangel, Agustinus Kristiadi

Viaarxiv icon

Convolutions Through the Lens of Tensor Networks

Add code
Bookmark button
Alert button
Jul 05, 2023
Felix Dangel

Figure 1 for Convolutions Through the Lens of Tensor Networks
Figure 2 for Convolutions Through the Lens of Tensor Networks
Figure 3 for Convolutions Through the Lens of Tensor Networks
Figure 4 for Convolutions Through the Lens of Tensor Networks
Viaarxiv icon

The Geometry of Neural Nets' Parameter Spaces Under Reparametrization

Add code
Bookmark button
Alert button
Feb 14, 2023
Agustinus Kristiadi, Felix Dangel, Philipp Hennig

Figure 1 for The Geometry of Neural Nets' Parameter Spaces Under Reparametrization
Figure 2 for The Geometry of Neural Nets' Parameter Spaces Under Reparametrization
Figure 3 for The Geometry of Neural Nets' Parameter Spaces Under Reparametrization
Figure 4 for The Geometry of Neural Nets' Parameter Spaces Under Reparametrization
Viaarxiv icon

ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

Add code
Bookmark button
Alert button
Jun 04, 2021
Felix Dangel, Lukas Tatzel, Philipp Hennig

Figure 1 for ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure
Figure 2 for ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure
Figure 3 for ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure
Viaarxiv icon

Cockpit: A Practical Debugging Tool for Training Deep Neural Networks

Add code
Bookmark button
Alert button
Feb 12, 2021
Frank Schneider, Felix Dangel, Philipp Hennig

Figure 1 for Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Figure 2 for Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Figure 3 for Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Figure 4 for Cockpit: A Practical Debugging Tool for Training Deep Neural Networks
Viaarxiv icon

BackPACK: Packing more into backprop

Add code
Bookmark button
Alert button
Feb 15, 2020
Felix Dangel, Frederik Kunstner, Philipp Hennig

Figure 1 for BackPACK: Packing more into backprop
Figure 2 for BackPACK: Packing more into backprop
Figure 3 for BackPACK: Packing more into backprop
Figure 4 for BackPACK: Packing more into backprop
Viaarxiv icon