Alert button
Picture for Glenn G. Ko

Glenn G. Ko

Alert button

INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation

Add code
Bookmark button
Alert button
Jun 13, 2023
Yuji Chai, John Gkountouras, Glenn G. Ko, David Brooks, Gu-Yeon Wei

Figure 1 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 2 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 3 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 4 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Viaarxiv icon

Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models

Add code
Bookmark button
Alert button
Sep 25, 2022
Yuji Chai, Luke Bailey, Yunho Jin, Matthew Karle, Glenn G. Ko

Figure 1 for Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models
Figure 2 for Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models
Figure 3 for Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models
Figure 4 for Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models
Viaarxiv icon