Picture for Zhenli Zhou

Zhenli Zhou

MemoryFormer: Minimize Transformer Computation by Removing Fully-Connected Layers

Add code
Nov 20, 2024
Figure 1 for MemoryFormer: Minimize Transformer Computation by Removing Fully-Connected Layers
Figure 2 for MemoryFormer: Minimize Transformer Computation by Removing Fully-Connected Layers
Figure 3 for MemoryFormer: Minimize Transformer Computation by Removing Fully-Connected Layers
Figure 4 for MemoryFormer: Minimize Transformer Computation by Removing Fully-Connected Layers
Viaarxiv icon