Alert button
Picture for Daniel Y. Fu

Daniel Y. Fu

Alert button

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Add code
Bookmark button
Alert button
Feb 14, 2024
Jon Saad-Falcon, Daniel Y. Fu, Simran Arora, Neel Guha, Christopher Ré

Viaarxiv icon

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Add code
Bookmark button
Alert button
Feb 07, 2024
Jordan Juravsky, Bradley Brown, Ryan Ehrlich, Daniel Y. Fu, Christopher Ré, Azalia Mirhoseini

Viaarxiv icon

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Add code
Bookmark button
Alert button
Nov 10, 2023
Daniel Y. Fu, Hermann Kumbong, Eric Nguyen, Christopher Ré

Viaarxiv icon

Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions

Add code
Bookmark button
Alert button
Oct 28, 2023
Stefano Massaroli, Michael Poli, Daniel Y. Fu, Hermann Kumbong, Rom N. Parnichkun, Aman Timalsina, David W. Romero, Quinn McIntyre, Beidi Chen, Atri Rudra, Ce Zhang, Christopher Re, Stefano Ermon, Yoshua Bengio

Figure 1 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 2 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 3 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 4 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Viaarxiv icon

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Add code
Bookmark button
Alert button
Oct 18, 2023
Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré

Viaarxiv icon

High-throughput Generative Inference of Large Language Models with a Single GPU

Add code
Bookmark button
Alert button
Mar 13, 2023
Ying Sheng, Lianmin Zheng, Binhang Yuan, Zhuohan Li, Max Ryabinin, Daniel Y. Fu, Zhiqiang Xie, Beidi Chen, Clark Barrett, Joseph E. Gonzalez, Percy Liang, Christopher Ré, Ion Stoica, Ce Zhang

Figure 1 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 2 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 3 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 4 for High-throughput Generative Inference of Large Language Models with a Single GPU
Viaarxiv icon

Hyena Hierarchy: Towards Larger Convolutional Language Models

Add code
Bookmark button
Alert button
Mar 06, 2023
Michael Poli, Stefano Massaroli, Eric Nguyen, Daniel Y. Fu, Tri Dao, Stephen Baccus, Yoshua Bengio, Stefano Ermon, Christopher Ré

Figure 1 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 2 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 3 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 4 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Viaarxiv icon

Simple Hardware-Efficient Long Convolutions for Sequence Modeling

Add code
Bookmark button
Alert button
Feb 13, 2023
Daniel Y. Fu, Elliot L. Epstein, Eric Nguyen, Armin W. Thomas, Michael Zhang, Tri Dao, Atri Rudra, Christopher Ré

Figure 1 for Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Figure 2 for Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Figure 3 for Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Figure 4 for Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Viaarxiv icon

Hungry Hungry Hippos: Towards Language Modeling with State Space Models

Add code
Bookmark button
Alert button
Dec 28, 2022
Tri Dao, Daniel Y. Fu, Khaled K. Saab, Armin W. Thomas, Atri Rudra, Christopher Ré

Figure 1 for Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Figure 2 for Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Figure 3 for Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Figure 4 for Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Viaarxiv icon