Alert button
Picture for Yuji Chai

Yuji Chai

Alert button

INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation

Add code
Bookmark button
Alert button
Jun 13, 2023
Yuji Chai, John Gkountouras, Glenn G. Ko, David Brooks, Gu-Yeon Wei

Figure 1 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 2 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 3 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Figure 4 for INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation
Viaarxiv icon

PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices

Add code
Bookmark button
Alert button
Jan 26, 2023
Yuji Chai, Devashree Tripathy, Chuteng Zhou, Dibakar Gope, Igor Fedorov, Ramon Matas, David Brooks, Gu-Yeon Wei, Paul Whatmough

Figure 1 for PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Figure 2 for PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Figure 3 for PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Figure 4 for PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Viaarxiv icon

Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models

Add code
Bookmark button
Alert button
Sep 25, 2022
Yuji Chai, Luke Bailey, Yunho Jin, Matthew Karle, Glenn G. Ko

Figure 1 for Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models
Figure 2 for Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models
Figure 3 for Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models
Figure 4 for Bigger&Faster: Two-stage Neural Architecture Search for Quantized Transformer Models
Viaarxiv icon