Alert button
Picture for Kurt Keutzer

Kurt Keutzer

Alert button

Magic-Me: Identity-Specific Video Customized Diffusion

Add code
Bookmark button
Alert button
Feb 14, 2024
Ze Ma, Daquan Zhou, Chun-Hsiao Yeh, Xue-She Wang, Xiuyu Li, Huanrui Yang, Zhen Dong, Kurt Keutzer, Jiashi Feng

Viaarxiv icon

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Add code
Bookmark button
Alert button
Feb 07, 2024
Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh, Michael W. Mahoney, Yakun Sophia Shao, Kurt Keutzer, Amir Gholami

Viaarxiv icon

Learned Best-Effort LLM Serving

Add code
Bookmark button
Alert button
Jan 15, 2024
Siddharth Jha, Coleman Hooper, Xiaoxuan Liu, Sehoon Kim, Kurt Keutzer

Viaarxiv icon

VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness

Add code
Bookmark button
Alert button
Jan 15, 2024
Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, Li Du, Shanghang Zhang

Viaarxiv icon

Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation

Add code
Bookmark button
Alert button
Dec 27, 2023
Rongyu Zhang, Yulin Luo, Jiaming Liu, Huanrui Yang, Zhen Dong, Denis Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Yuan Du, Shanghang Zhang

Viaarxiv icon

StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Add code
Bookmark button
Alert button
Dec 19, 2023
Akio Kodaira, Chenfeng Xu, Toshiki Hazama, Takanori Yoshimoto, Kohei Ohno, Shogo Mitsuhori, Soichi Sugano, Hanying Cho, Zhijian Liu, Kurt Keutzer

Viaarxiv icon

Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting

Add code
Bookmark button
Alert button
Dec 14, 2023
Anthony Chen, Huanrui Yang, Yulu Gan, Denis A Gudovskiy, Zhen Dong, Haofan Wang, Tomoyuki Okuno, Yohei Nakata, Shanghang Zhang, Kurt Keutzer

Viaarxiv icon

An LLM Compiler for Parallel Function Calling

Add code
Bookmark button
Alert button
Dec 07, 2023
Sehoon Kim, Suhong Moon, Ryan Tabrizi, Nicholas Lee, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

Viaarxiv icon

MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration

Add code
Bookmark button
Alert button
Nov 16, 2023
Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See Kiong Ng, Jiashi Feng

Viaarxiv icon