Picture for Zhiteng Li

Zhiteng Li

ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration

Add code
May 30, 2025
Viaarxiv icon

DVD-Quant: Data-free Video Diffusion Transformers Quantization

Add code
May 24, 2025
Viaarxiv icon

Low-bit Model Quantization for Deep Neural Networks: A Survey

Add code
May 08, 2025
Viaarxiv icon

QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation

Add code
Mar 09, 2025
Viaarxiv icon

CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution

Add code
Feb 21, 2025
Viaarxiv icon

AdaSVD: Adaptive Singular Value Decomposition for Large Language Models

Add code
Feb 04, 2025
Figure 1 for AdaSVD: Adaptive Singular Value Decomposition for Large Language Models
Figure 2 for AdaSVD: Adaptive Singular Value Decomposition for Large Language Models
Figure 3 for AdaSVD: Adaptive Singular Value Decomposition for Large Language Models
Figure 4 for AdaSVD: Adaptive Singular Value Decomposition for Large Language Models
Viaarxiv icon

Progressive Binarization with Semi-Structured Pruning for LLMs

Add code
Feb 03, 2025
Viaarxiv icon

ARB-LLM: Alternating Refined Binarizations for Large Language Models

Add code
Oct 04, 2024
Figure 1 for ARB-LLM: Alternating Refined Binarizations for Large Language Models
Figure 2 for ARB-LLM: Alternating Refined Binarizations for Large Language Models
Figure 3 for ARB-LLM: Alternating Refined Binarizations for Large Language Models
Figure 4 for ARB-LLM: Alternating Refined Binarizations for Large Language Models
Viaarxiv icon

Binarized 3D Whole-body Human Mesh Recovery

Add code
Nov 24, 2023
Viaarxiv icon