Picture for Haotong Qin

Haotong Qin

ARB-LLM: Alternating Refined Binarizations for Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

Add code
Sep 25, 2024
Viaarxiv icon

2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution

Add code
Jun 10, 2024
Viaarxiv icon

Binarized Diffusion Model for Image Super-Resolution

Add code
Jun 09, 2024
Viaarxiv icon

SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models

Add code
May 23, 2024
Viaarxiv icon

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Add code
Apr 22, 2024
Viaarxiv icon

BinaryDM: Towards Accurate Binarization of Diffusion Model

Add code
Apr 08, 2024
Figure 1 for BinaryDM: Towards Accurate Binarization of Diffusion Model
Figure 2 for BinaryDM: Towards Accurate Binarization of Diffusion Model
Figure 3 for BinaryDM: Towards Accurate Binarization of Diffusion Model
Figure 4 for BinaryDM: Towards Accurate Binarization of Diffusion Model
Viaarxiv icon

Graph Construction with Flexible Nodes for Traffic Demand Prediction

Add code
Mar 01, 2024
Viaarxiv icon

DB-LLM: Accurate Dual-Binarization for Efficient LLMs

Add code
Feb 19, 2024
Figure 1 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 2 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 3 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 4 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Viaarxiv icon

Accurate LoRA-Finetuning Quantization of LLMs via Information Retention

Add code
Feb 08, 2024
Viaarxiv icon