Picture for Chengxi Liao

Chengxi Liao

DeInfer: Efficient Parallel Inferencing for Decomposed Large Language Models

Add code
Apr 20, 2026
Viaarxiv icon