Alert button
Picture for Kurt Keutzer

Kurt Keutzer

Alert button

Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning

Add code
Bookmark button
Alert button
Apr 13, 2024
Yijiang Liu, Rongyu Zhang, Huanrui Yang, Kurt Keutzer, Yuan Du, Li Du, Shanghang Zhang

Viaarxiv icon

LLoCO: Learning Long Contexts Offline

Add code
Bookmark button
Alert button
Apr 11, 2024
Sijun Tan, Xiuyu Li, Shishir Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Ada Popa

Viaarxiv icon

RouterBench: A Benchmark for Multi-LLM Routing System

Add code
Bookmark button
Alert button
Mar 28, 2024
Qitian Jason Hu, Jacob Bieker, Xiuyu Li, Nan Jiang, Benjamin Keigwin, Gaurav Ranganath, Kurt Keutzer, Shriyash Kaustubh Upadhyay

Figure 1 for RouterBench: A Benchmark for Multi-LLM Routing System
Figure 2 for RouterBench: A Benchmark for Multi-LLM Routing System
Figure 3 for RouterBench: A Benchmark for Multi-LLM Routing System
Figure 4 for RouterBench: A Benchmark for Multi-LLM Routing System
Viaarxiv icon

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Add code
Bookmark button
Alert button
Mar 22, 2024
Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipali, Michael W. Mahoney, Kurt Keutzer, Amir Gholami

Viaarxiv icon

AI and Memory Wall

Add code
Bookmark button
Alert button
Mar 21, 2024
Amir Gholami, Zhewei Yao, Sehoon Kim, Coleman Hooper, Michael W. Mahoney, Kurt Keutzer

Figure 1 for AI and Memory Wall
Figure 2 for AI and Memory Wall
Figure 3 for AI and Memory Wall
Figure 4 for AI and Memory Wall
Viaarxiv icon

ROUTERBENCH: A Benchmark for Multi-LLM Routing System

Add code
Bookmark button
Alert button
Mar 18, 2024
Qitian Jason Hu, Jacob Bieker, Xiuyu Li, Nan Jiang, Benjamin Keigwin, Gaurav Ranganath, Kurt Keutzer, Shriyash Kaustubh Upadhyay

Figure 1 for ROUTERBENCH: A Benchmark for Multi-LLM Routing System
Figure 2 for ROUTERBENCH: A Benchmark for Multi-LLM Routing System
Figure 3 for ROUTERBENCH: A Benchmark for Multi-LLM Routing System
Figure 4 for ROUTERBENCH: A Benchmark for Multi-LLM Routing System
Viaarxiv icon

Q-SLAM: Quadric Representations for Monocular SLAM

Add code
Bookmark button
Alert button
Mar 12, 2024
Chensheng Peng, Chenfeng Xu, Yue Wang, Mingyu Ding, Heng Yang, Masayoshi Tomizuka, Kurt Keutzer, Marco Pavone, Wei Zhan

Figure 1 for Q-SLAM: Quadric Representations for Monocular SLAM
Figure 2 for Q-SLAM: Quadric Representations for Monocular SLAM
Figure 3 for Q-SLAM: Quadric Representations for Monocular SLAM
Figure 4 for Q-SLAM: Quadric Representations for Monocular SLAM
Viaarxiv icon

LLM Inference Unveiled: Survey and Roofline Model Insights

Add code
Bookmark button
Alert button
Mar 11, 2024
Zhihang Yuan, Yuzhang Shang, Yang Zhou, Zhen Dong, Zhe Zhou, Chenhao Xue, Bingzhe Wu, Zhikai Li, Qingyi Gu, Yong Jae Lee, Yan Yan, Beidi Chen, Guangyu Sun, Kurt Keutzer

Viaarxiv icon