Picture for Markus Hoehnerbach

Markus Hoehnerbach

FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling

Add code
Mar 05, 2026
Viaarxiv icon

Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light

Add code
Apr 23, 2025
Viaarxiv icon

Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE

Add code
Apr 14, 2025
Figure 1 for Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE
Figure 2 for Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE
Figure 3 for Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE
Figure 4 for Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE
Viaarxiv icon