Alert button
Picture for Dmitry Belenko

Dmitry Belenko

Alert button

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Add code
Bookmark button
Alert button
Dec 12, 2023
Keivan Alizadeh, Iman Mirzadeh, Dmitry Belenko, Karen Khatamifard, Minsik Cho, Carlo C Del Mundo, Mohammad Rastegari, Mehrdad Farajtabar

Viaarxiv icon