Picture for Sheng Di

Sheng Di

PackKV: Reducing KV Cache Memory Footprint through LLM-Aware Lossy Compression

Add code
Dec 30, 2025
Viaarxiv icon

DeepCQ: General-Purpose Deep-Surrogate Framework for Lossy Compression Quality Prediction

Add code
Dec 24, 2025
Viaarxiv icon

An Efficient Gradient-Aware Error-Bounded Lossy Compressor for Federated Learning

Add code
Nov 07, 2025
Viaarxiv icon

MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model?

Add code
Sep 09, 2025
Viaarxiv icon

Systematic Evaluation of Optimization Techniques for Long-Context Language Models

Add code
Aug 01, 2025
Viaarxiv icon

CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning

Add code
Feb 26, 2025
Viaarxiv icon

FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance

Add code
Aug 02, 2024
Figure 1 for FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance
Figure 2 for FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance
Figure 3 for FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance
Figure 4 for FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance
Viaarxiv icon

FedFa: A Fully Asynchronous Training Paradigm for Federated Learning

Add code
Apr 17, 2024
Figure 1 for FedFa: A Fully Asynchronous Training Paradigm for Federated Learning
Figure 2 for FedFa: A Fully Asynchronous Training Paradigm for Federated Learning
Figure 3 for FedFa: A Fully Asynchronous Training Paradigm for Federated Learning
Figure 4 for FedFa: A Fully Asynchronous Training Paradigm for Federated Learning
Viaarxiv icon

Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets

Add code
Mar 23, 2024
Figure 1 for Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets
Figure 2 for Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets
Figure 3 for Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets
Figure 4 for Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets
Viaarxiv icon

SRN-SZ: Deep Leaning-Based Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks

Add code
Sep 07, 2023
Figure 1 for SRN-SZ: Deep Leaning-Based Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks
Figure 2 for SRN-SZ: Deep Leaning-Based Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks
Figure 3 for SRN-SZ: Deep Leaning-Based Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks
Figure 4 for SRN-SZ: Deep Leaning-Based Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks
Viaarxiv icon