Picture for Kyoungmin Kim

Kyoungmin Kim

Fast LLM-Based Semantic Filtering: From a Unified Framework to an Adaptive Two-Phase Method

Add code
Jun 06, 2026
Viaarxiv icon

Trustworthy and Efficient LLMs Meet Databases

Add code
Dec 23, 2024
Viaarxiv icon

The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving

Add code
Nov 12, 2024
Figure 1 for The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving
Figure 2 for The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving
Figure 3 for The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving
Figure 4 for The Effect of Scheduling and Preemption on the Efficiency of LLM Inference Serving
Viaarxiv icon