Picture for Danyang Zhuo

Danyang Zhuo

Duke University

Parallel Prefix Verification for Speculative Generation

Add code
May 05, 2026
Viaarxiv icon

Foundry: Template-Based CUDA Graph Context Materialization for Fast LLM Serving Cold Start

Add code
Apr 08, 2026
Viaarxiv icon

Real-Time and Scalable Zak-OTFS Receiver Processing on GPUs

Add code
Apr 02, 2026
Viaarxiv icon

InfoFlow KV: Information-Flow-Aware KV Recomputation for Long Context

Add code
Mar 05, 2026
Viaarxiv icon

Agentic AI for Scalable and Robust Optical Systems Control

Add code
Feb 23, 2026
Viaarxiv icon

Curator: Efficient Vector Search with Low-Selectivity Filters

Add code
Jan 07, 2026
Viaarxiv icon

Phantora: Live GPU Cluster Simulation for Machine Learning System Performance Estimation

Add code
May 02, 2025
Viaarxiv icon

HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs

Add code
Apr 04, 2025
Viaarxiv icon

Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement

Add code
Jul 05, 2024
Figure 1 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 2 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 3 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Figure 4 for Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Viaarxiv icon

VcLLM: Video Codecs are Secretly Tensor Codecs

Add code
Jun 29, 2024
Figure 1 for VcLLM: Video Codecs are Secretly Tensor Codecs
Figure 2 for VcLLM: Video Codecs are Secretly Tensor Codecs
Figure 3 for VcLLM: Video Codecs are Secretly Tensor Codecs
Figure 4 for VcLLM: Video Codecs are Secretly Tensor Codecs
Viaarxiv icon