Picture for Hongli Xu

Hongli Xu

SABlock: Semantic-Aware KV Cache Eviction with Adaptive Compression Block Size

Add code
Oct 26, 2025
Viaarxiv icon

Enabling Reconfiguration-Communication Overlap for Collective Communication in Optical Networks

Add code
Oct 22, 2025
Viaarxiv icon

Accelerating Mixture-of-Expert Inference with Adaptive Expert Split Mechanism

Add code
Sep 10, 2025
Viaarxiv icon

Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning

Add code
Jun 06, 2025
Viaarxiv icon

PRISM: Probabilistic Representation for Integrated Shape Modeling and Generation

Add code
Apr 06, 2025
Viaarxiv icon

Resource-Efficient Federated Fine-Tuning Large Language Models for Heterogeneous Data

Add code
Mar 27, 2025
Viaarxiv icon

Collaborative Speculative Inference for Efficient LLM Inference Serving

Add code
Mar 13, 2025
Viaarxiv icon

Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout

Add code
Mar 13, 2025
Viaarxiv icon

GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation

Add code
Feb 06, 2025
Viaarxiv icon

Lightweight and Post-Training Structured Pruning for On-Device Large Lanaguage Models

Add code
Jan 25, 2025
Viaarxiv icon