Picture for Junjia Du

Junjia Du

Nexus: Taming Throughput-Latency Tradeoff in LLM Serving via Efficient GPU Sharing

Add code
Jul 09, 2025
Viaarxiv icon

Kongzi: A Historical Large Language Model with Fact Enhancement

Add code
Apr 13, 2025
Viaarxiv icon

Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models

Add code
Apr 10, 2025
Viaarxiv icon

DependEval: Benchmarking LLMs for Repository Dependency Understanding

Add code
Mar 09, 2025
Viaarxiv icon