Picture for Lingchao Zheng

Lingchao Zheng

Multi-Scale Dequant: Eliminating Dequantization Bottleneck via Activation Decomposition for Efficient LLM Inference

Add code
May 13, 2026
Viaarxiv icon

AIS: Adaptive Importance Sampling for Quantized RL

Add code
May 13, 2026
Viaarxiv icon

LoPT: Lossless Parallel Tokenization Acceleration for Long Context Inference of Large Language Model

Add code
Nov 07, 2025
Viaarxiv icon