Alert button
Picture for Carlo Luschi

Carlo Luschi

Alert button

Reducing the Cost of Quantum Chemical Data By Backpropagating Through Density Functional Theory

Add code
Bookmark button
Alert button
Feb 06, 2024
Alexander Mathiasen, Hatem Helal, Paul Balanca, Adam Krzywaniak, Ali Parviz, Frederik Hvilshøj, Blazej Banaszewski, Carlo Luschi, Andrew William Fitzgibbon

Viaarxiv icon

SparQ Attention: Bandwidth-Efficient LLM Inference

Add code
Bookmark button
Alert button
Dec 08, 2023
Luka Ribar, Ivan Chelombiev, Luke Hudlass-Galley, Charlie Blake, Carlo Luschi, Douglas Orr

Viaarxiv icon

Generating QM1B with PySCF$_{\text{IPU}}$

Add code
Bookmark button
Alert button
Nov 02, 2023
Alexander Mathiasen, Hatem Helal, Kerstin Klaser, Paul Balanca, Josef Dean, Carlo Luschi, Dominique Beaini, Andrew Fitzgibbon, Dominic Masters

Viaarxiv icon

Training and inference of large language models using 8-bit floating point

Add code
Bookmark button
Alert button
Sep 29, 2023
Sergio P. Perez, Yan Zhang, James Briggs, Charlie Blake, Josh Levy-Kramer, Paul Balanca, Carlo Luschi, Stephen Barlow, Andrew William Fitzgibbon

Viaarxiv icon

Unit Scaling: Out-of-the-Box Low-Precision Training

Add code
Bookmark button
Alert button
Mar 20, 2023
Charlie Blake, Douglas Orr, Carlo Luschi

Figure 1 for Unit Scaling: Out-of-the-Box Low-Precision Training
Figure 2 for Unit Scaling: Out-of-the-Box Low-Precision Training
Figure 3 for Unit Scaling: Out-of-the-Box Low-Precision Training
Figure 4 for Unit Scaling: Out-of-the-Box Low-Precision Training
Viaarxiv icon

BESS: Balanced Entity Sampling and Sharing for Large-Scale Knowledge Graph Completion

Add code
Bookmark button
Alert button
Nov 22, 2022
Alberto Cattaneo, Daniel Justus, Harry Mellor, Douglas Orr, Jerome Maloberti, Zhenying Liu, Thorin Farnsworth, Andrew Fitzgibbon, Blazej Banaszewski, Carlo Luschi

Figure 1 for BESS: Balanced Entity Sampling and Sharing for Large-Scale Knowledge Graph Completion
Figure 2 for BESS: Balanced Entity Sampling and Sharing for Large-Scale Knowledge Graph Completion
Figure 3 for BESS: Balanced Entity Sampling and Sharing for Large-Scale Knowledge Graph Completion
Figure 4 for BESS: Balanced Entity Sampling and Sharing for Large-Scale Knowledge Graph Completion
Viaarxiv icon

8-bit Numerical Formats for Deep Neural Networks

Add code
Bookmark button
Alert button
Jun 06, 2022
Badreddine Noune, Philip Jones, Daniel Justus, Dominic Masters, Carlo Luschi

Figure 1 for 8-bit Numerical Formats for Deep Neural Networks
Figure 2 for 8-bit Numerical Formats for Deep Neural Networks
Figure 3 for 8-bit Numerical Formats for Deep Neural Networks
Figure 4 for 8-bit Numerical Formats for Deep Neural Networks
Viaarxiv icon

Towards Structured Dynamic Sparse Pre-Training of BERT

Add code
Bookmark button
Alert button
Aug 13, 2021
Anastasia Dietrich, Frithjof Gressmann, Douglas Orr, Ivan Chelombiev, Daniel Justus, Carlo Luschi

Figure 1 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 2 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 3 for Towards Structured Dynamic Sparse Pre-Training of BERT
Figure 4 for Towards Structured Dynamic Sparse Pre-Training of BERT
Viaarxiv icon

Making EfficientNet More Efficient: Exploring Batch-Independent Normalization, Group Convolutions and Reduced Resolution Training

Add code
Bookmark button
Alert button
Jun 15, 2021
Dominic Masters, Antoine Labatie, Zach Eaton-Rosen, Carlo Luschi

Figure 1 for Making EfficientNet More Efficient: Exploring Batch-Independent Normalization, Group Convolutions and Reduced Resolution Training
Figure 2 for Making EfficientNet More Efficient: Exploring Batch-Independent Normalization, Group Convolutions and Reduced Resolution Training
Figure 3 for Making EfficientNet More Efficient: Exploring Batch-Independent Normalization, Group Convolutions and Reduced Resolution Training
Figure 4 for Making EfficientNet More Efficient: Exploring Batch-Independent Normalization, Group Convolutions and Reduced Resolution Training
Viaarxiv icon