Picture for Daniel Y. Fu

Daniel Y. Fu

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Add code
Feb 14, 2024
Figure 1 for Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Figure 2 for Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Figure 3 for Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Figure 4 for Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT
Viaarxiv icon

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Add code
Feb 07, 2024
Figure 1 for Hydragen: High-Throughput LLM Inference with Shared Prefixes
Figure 2 for Hydragen: High-Throughput LLM Inference with Shared Prefixes
Figure 3 for Hydragen: High-Throughput LLM Inference with Shared Prefixes
Figure 4 for Hydragen: High-Throughput LLM Inference with Shared Prefixes
Viaarxiv icon

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Add code
Nov 10, 2023
Figure 1 for FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Figure 2 for FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Figure 3 for FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Figure 4 for FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Viaarxiv icon

Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions

Add code
Oct 28, 2023
Figure 1 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 2 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 3 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Figure 4 for Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions
Viaarxiv icon

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Add code
Oct 18, 2023
Figure 1 for Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture
Figure 2 for Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture
Figure 3 for Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture
Figure 4 for Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture
Viaarxiv icon

High-throughput Generative Inference of Large Language Models with a Single GPU

Add code
Mar 13, 2023
Figure 1 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 2 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 3 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 4 for High-throughput Generative Inference of Large Language Models with a Single GPU
Viaarxiv icon

Hyena Hierarchy: Towards Larger Convolutional Language Models

Add code
Mar 06, 2023
Figure 1 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 2 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 3 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Figure 4 for Hyena Hierarchy: Towards Larger Convolutional Language Models
Viaarxiv icon

Simple Hardware-Efficient Long Convolutions for Sequence Modeling

Add code
Feb 13, 2023
Figure 1 for Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Figure 2 for Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Figure 3 for Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Figure 4 for Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Viaarxiv icon

Hungry Hungry Hippos: Towards Language Modeling with State Space Models

Add code
Dec 28, 2022
Figure 1 for Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Figure 2 for Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Figure 3 for Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Figure 4 for Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Viaarxiv icon

Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models

Add code
Jun 10, 2022
Figure 1 for Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models
Figure 2 for Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models
Figure 3 for Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models
Figure 4 for Lost in Transmission: On the Impact of Networking Corruptions on Video Machine Learning Models
Viaarxiv icon