Picture for Shixuan Sun

Shixuan Sun

ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive

Add code
Aug 26, 2025
Viaarxiv icon

SMA: Who Said That? Auditing Membership Leakage in Semi-Black-box RAG Controlling

Add code
Aug 12, 2025
Viaarxiv icon

Efficient Serving of LLM Applications with Probabilistic Demand Modeling

Add code
Jun 17, 2025
Viaarxiv icon

Pushing the Limits of Safety: A Technical Report on the ATLAS Challenge 2025

Add code
Jun 14, 2025
Viaarxiv icon

A Framework to Assess Multilingual Vulnerabilities of LLMs

Add code
Mar 17, 2025
Figure 1 for A Framework to Assess Multilingual Vulnerabilities of LLMs
Figure 2 for A Framework to Assess Multilingual Vulnerabilities of LLMs
Figure 3 for A Framework to Assess Multilingual Vulnerabilities of LLMs
Figure 4 for A Framework to Assess Multilingual Vulnerabilities of LLMs
Viaarxiv icon

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving

Add code
Jul 22, 2024
Figure 1 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 2 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 3 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 4 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Viaarxiv icon

CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion

Add code
Jun 28, 2024
Viaarxiv icon

Efficient Deep Learning Pipelines for Accurate Cost Estimations Over Large Scale Query Workload

Add code
Mar 23, 2021
Figure 1 for Efficient Deep Learning Pipelines for Accurate Cost Estimations Over Large Scale Query Workload
Viaarxiv icon