Alert button
Picture for Daiyaan Arfeen

Daiyaan Arfeen

Alert button

SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification

Add code
Bookmark button
Alert button
May 16, 2023
Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Rae Ying Yee Wong, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia

Figure 1 for SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification
Figure 2 for SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification
Figure 3 for SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification
Figure 4 for SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification
Viaarxiv icon

HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks

Add code
Bookmark button
Alert button
Nov 10, 2019
Zhen Dong, Zhewei Yao, Yaohui Cai, Daiyaan Arfeen, Amir Gholami, Michael W. Mahoney, Kurt Keutzer

Figure 1 for HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Figure 2 for HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Figure 3 for HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Figure 4 for HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Viaarxiv icon

Unsupervised Projection Networks for Generative Adversarial Networks

Add code
Bookmark button
Alert button
Oct 06, 2019
Daiyaan Arfeen, Jesse Zhang

Figure 1 for Unsupervised Projection Networks for Generative Adversarial Networks
Figure 2 for Unsupervised Projection Networks for Generative Adversarial Networks
Figure 3 for Unsupervised Projection Networks for Generative Adversarial Networks
Figure 4 for Unsupervised Projection Networks for Generative Adversarial Networks
Viaarxiv icon