Picture for Yue Zhu

Yue Zhu

Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference

Add code
May 28, 2025
Viaarxiv icon

AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection

Add code
May 15, 2025
Viaarxiv icon

FT-Transformer: Resilient and Reliable Transformer with End-to-End Fault Tolerant Attention

Add code
Apr 03, 2025
Viaarxiv icon

Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference

Add code
Mar 11, 2025
Viaarxiv icon

KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification

Add code
Feb 10, 2025
Viaarxiv icon

Towards Pareto Optimal Throughput in Small Language Model Serving

Add code
Apr 04, 2024
Viaarxiv icon

The Case for Universal Basic Computing Power

Add code
Nov 18, 2023
Viaarxiv icon

Do Physicians Know How to Prompt? The Need for Automatic Prompt Optimization Help in Clinical Note Generation

Add code
Nov 16, 2023
Viaarxiv icon

H3WB: Human3.6M 3D WholeBody Dataset and Benchmark

Add code
Nov 28, 2022
Figure 1 for H3WB: Human3.6M 3D WholeBody Dataset and Benchmark
Figure 2 for H3WB: Human3.6M 3D WholeBody Dataset and Benchmark
Figure 3 for H3WB: Human3.6M 3D WholeBody Dataset and Benchmark
Figure 4 for H3WB: Human3.6M 3D WholeBody Dataset and Benchmark
Viaarxiv icon

Decanus to Legatus: Synthetic training for 2D-3D human pose lifting

Add code
Oct 05, 2022
Figure 1 for Decanus to Legatus: Synthetic training for 2D-3D human pose lifting
Figure 2 for Decanus to Legatus: Synthetic training for 2D-3D human pose lifting
Figure 3 for Decanus to Legatus: Synthetic training for 2D-3D human pose lifting
Figure 4 for Decanus to Legatus: Synthetic training for 2D-3D human pose lifting
Viaarxiv icon