Alert button
Picture for Bobby He

Bobby He

Alert button

Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends

Add code
Bookmark button
Alert button
Mar 12, 2024
Sidak Pal Singh, Bobby He, Thomas Hofmann, Bernhard Schölkopf

Figure 1 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 2 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 3 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Figure 4 for Hallmarks of Optimization Trajectories in Neural Networks and LLMs: The Lengths, Bends, and Dead Ends
Viaarxiv icon

Recurrent Distance-Encoding Neural Networks for Graph Representation Learning

Add code
Bookmark button
Alert button
Dec 03, 2023
Yuhui Ding, Antonio Orvieto, Bobby He, Thomas Hofmann

Viaarxiv icon

Simplifying Transformer Blocks

Add code
Bookmark button
Alert button
Nov 03, 2023
Bobby He, Thomas Hofmann

Viaarxiv icon

The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit

Add code
Bookmark button
Alert button
Jun 30, 2023
Lorenzo Noci, Chuning Li, Mufan Bill Li, Bobby He, Thomas Hofmann, Chris Maddison, Daniel M. Roy

Figure 1 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 2 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 3 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Figure 4 for The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
Viaarxiv icon

Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation

Add code
Bookmark button
Alert button
Feb 20, 2023
Bobby He, James Martens, Guodong Zhang, Aleksandar Botev, Andrew Brock, Samuel L Smith, Yee Whye Teh

Figure 1 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 2 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 3 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Figure 4 for Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Viaarxiv icon

UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography

Add code
Bookmark button
Alert button
Feb 22, 2022
Francisca Vasconcelos, Bobby He, Nalini Singh, Yee Whye Teh

Figure 1 for UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Figure 2 for UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Figure 3 for UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Figure 4 for UncertaINR: Uncertainty Quantification of End-to-End Implicit Neural Representations for Computed Tomography
Viaarxiv icon

Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning

Add code
Bookmark button
Alert button
Oct 22, 2021
Soufiane Hayou, Bobby He, Gintare Karolina Dziugaite

Figure 1 for Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning
Figure 2 for Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning
Figure 3 for Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning
Figure 4 for Probabilistic fine-tuning of pruning masks and PAC-Bayes self-bounded learning
Viaarxiv icon

Stable ResNet

Add code
Bookmark button
Alert button
Oct 24, 2020
Soufiane Hayou, Eugenio Clerico, Bobby He, George Deligiannidis, Arnaud Doucet, Judith Rousseau

Figure 1 for Stable ResNet
Figure 2 for Stable ResNet
Figure 3 for Stable ResNet
Figure 4 for Stable ResNet
Viaarxiv icon

Bayesian Deep Ensembles via the Neural Tangent Kernel

Add code
Bookmark button
Alert button
Jul 11, 2020
Bobby He, Balaji Lakshminarayanan, Yee Whye Teh

Figure 1 for Bayesian Deep Ensembles via the Neural Tangent Kernel
Figure 2 for Bayesian Deep Ensembles via the Neural Tangent Kernel
Figure 3 for Bayesian Deep Ensembles via the Neural Tangent Kernel
Figure 4 for Bayesian Deep Ensembles via the Neural Tangent Kernel
Viaarxiv icon