Alert button
Picture for Kurt Keutzer

Kurt Keutzer

Alert button

Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning

Add code
Bookmark button
Alert button
Apr 13, 2024
Yijiang Liu, Rongyu Zhang, Huanrui Yang, Kurt Keutzer, Yuan Du, Li Du, Shanghang Zhang

Viaarxiv icon

LLoCO: Learning Long Contexts Offline

Add code
Bookmark button
Alert button
Apr 11, 2024
Sijun Tan, Xiuyu Li, Shishir Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Ada Popa

Viaarxiv icon

RouterBench: A Benchmark for Multi-LLM Routing System

Add code
Bookmark button
Alert button
Mar 28, 2024
Qitian Jason Hu, Jacob Bieker, Xiuyu Li, Nan Jiang, Benjamin Keigwin, Gaurav Ranganath, Kurt Keutzer, Shriyash Kaustubh Upadhyay

Figure 1 for RouterBench: A Benchmark for Multi-LLM Routing System
Figure 2 for RouterBench: A Benchmark for Multi-LLM Routing System
Figure 3 for RouterBench: A Benchmark for Multi-LLM Routing System
Figure 4 for RouterBench: A Benchmark for Multi-LLM Routing System
Viaarxiv icon

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Add code
Bookmark button
Alert button
Mar 22, 2024
Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipali, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

Viaarxiv icon

AI and Memory Wall

Add code
Bookmark button
Alert button
Mar 21, 2024
Amir Gholami, Zhewei Yao, Sehoon Kim, Coleman Hooper, Michael W. Mahoney, Kurt Keutzer

Figure 1 for AI and Memory Wall
Figure 2 for AI and Memory Wall
Figure 3 for AI and Memory Wall
Figure 4 for AI and Memory Wall
Viaarxiv icon

Q-SLAM: Quadric Representations for Monocular SLAM

Add code
Bookmark button
Alert button
Mar 12, 2024
Chensheng Peng, Chenfeng Xu, Yue Wang, Mingyu Ding, Heng Yang, Masayoshi Tomizuka, Kurt Keutzer, Marco Pavone, Wei Zhan

Figure 1 for Q-SLAM: Quadric Representations for Monocular SLAM
Figure 2 for Q-SLAM: Quadric Representations for Monocular SLAM
Figure 3 for Q-SLAM: Quadric Representations for Monocular SLAM
Figure 4 for Q-SLAM: Quadric Representations for Monocular SLAM
Viaarxiv icon

LLM Inference Unveiled: Survey and Roofline Model Insights

Add code
Bookmark button
Alert button
Mar 11, 2024
Zhihang Yuan, Yuzhang Shang, Yang Zhou, Zhen Dong, Zhe Zhou, Chenhao Xue, Bingzhe Wu, Zhikai Li, Qingyi Gu, Yong Jae Lee, Yan Yan, Beidi Chen, Guangyu Sun, Kurt Keutzer

Viaarxiv icon

Magic-Me: Identity-Specific Video Customized Diffusion

Add code
Bookmark button
Alert button
Feb 14, 2024
Ze Ma, Daquan Zhou, Chun-Hsiao Yeh, Xue-She Wang, Xiuyu Li, Huanrui Yang, Zhen Dong, Kurt Keutzer, Jiashi Feng

Figure 1 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 2 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 3 for Magic-Me: Identity-Specific Video Customized Diffusion
Figure 4 for Magic-Me: Identity-Specific Video Customized Diffusion
Viaarxiv icon

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Add code
Bookmark button
Alert button
Feb 07, 2024
Coleman Hooper, Sehoon Kim, Hiva Mohammadzadeh, Michael W. Mahoney, Yakun Sophia Shao, Kurt Keutzer, Amir Gholami

Viaarxiv icon

VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness

Add code
Bookmark button
Alert button
Jan 15, 2024
Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, Li Du, Shanghang Zhang

Viaarxiv icon