Alert button
Picture for Kan Zhu

Kan Zhu

Alert button

Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models

Add code
Bookmark button
Alert button
Feb 10, 2024
Keisuke Kamahori, Yile Gu, Kan Zhu, Baris Kasikci

Viaarxiv icon

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Add code
Bookmark button
Alert button
Nov 07, 2023
Yilong Zhao, Chien-Yu Lin, Kan Zhu, Zihao Ye, Lequn Chen, Size Zheng, Luis Ceze, Arvind Krishnamurthy, Tianqi Chen, Baris Kasikci

Figure 1 for Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Figure 2 for Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Figure 3 for Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Figure 4 for Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Viaarxiv icon

Practical Algorithms for Learning Near-Isometric Linear Embeddings

Add code
Bookmark button
Alert button
Apr 22, 2016
Jerry Luo, Kayla Shapiro, Hao-Jun Michael Shi, Qi Yang, Kan Zhu

Figure 1 for Practical Algorithms for Learning Near-Isometric Linear Embeddings
Figure 2 for Practical Algorithms for Learning Near-Isometric Linear Embeddings
Figure 3 for Practical Algorithms for Learning Near-Isometric Linear Embeddings
Figure 4 for Practical Algorithms for Learning Near-Isometric Linear Embeddings
Viaarxiv icon