Alert button
Picture for Sanjiv Kumar

Sanjiv Kumar

Alert button

Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts

Add code
Bookmark button
Alert button
Apr 14, 2024
Taehyeon Kim, Ananda Theertha Suresh, Kishore Papineni, Michael Riley, Sanjiv Kumar, Adrian Benton

Viaarxiv icon

SOAR: Improved Indexing for Approximate Nearest Neighbor Search

Add code
Bookmark button
Alert button
Mar 31, 2024
Philip Sun, David Simcha, Dave Dopson, Ruiqi Guo, Sanjiv Kumar

Viaarxiv icon

Metric-aware LLM inference

Add code
Bookmark button
Alert button
Mar 07, 2024
Michal Lukasik, Harikrishna Narasimhan, Aditya Krishna Menon, Felix Yu, Sanjiv Kumar

Figure 1 for Metric-aware LLM inference
Figure 2 for Metric-aware LLM inference
Figure 3 for Metric-aware LLM inference
Figure 4 for Metric-aware LLM inference
Viaarxiv icon

HiRE: High Recall Approximate Top-$k$ Estimation for Efficient LLM Inference

Add code
Bookmark button
Alert button
Feb 14, 2024
Yashas Samaga B L, Varun Yerram, Chong You, Srinadh Bhojanapalli, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

Viaarxiv icon

Tandem Transformers for Inference Efficient LLMs

Add code
Bookmark button
Alert button
Feb 13, 2024
Aishwarya P S, Pranav Ajit Nair, Yashas Samaga, Toby Boyd, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

Viaarxiv icon

Efficient Stagewise Pretraining via Progressive Subnetworks

Add code
Bookmark button
Alert button
Feb 08, 2024
Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu, Sobhan Miryoosefi, Sashank Reddi, Satyen Kale, Sanjiv Kumar

Viaarxiv icon

SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection

Add code
Bookmark button
Alert button
Jan 24, 2024
Ke Ye, Heinrich Jiang, Afshin Rostamizadeh, Ayan Chakrabarti, Giulia DeSalvo, Jean-François Kagy, Lazaros Karydas, Gui Citovsky, Sanjiv Kumar

Viaarxiv icon

A Weighted K-Center Algorithm for Data Subset Selection

Add code
Bookmark button
Alert button
Dec 17, 2023
Srikumar Ramalingam, Pranjal Awasthi, Sanjiv Kumar

Viaarxiv icon

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Add code
Bookmark button
Alert button
Dec 15, 2023
Renat Aksitov, Sobhan Miryoosefi, Zonglin Li, Daliang Li, Sheila Babayan, Kavya Kopparapu, Zachary Fisher, Ruiqi Guo, Sushant Prakash, Pranesh Srinivasan, Manzil Zaheer, Felix Yu, Sanjiv Kumar

Viaarxiv icon

It's an Alignment, Not a Trade-off: Revisiting Bias and Variance in Deep Models

Add code
Bookmark button
Alert button
Oct 13, 2023
Lin Chen, Michal Lukasik, Wittawat Jitkrittum, Chong You, Sanjiv Kumar

Viaarxiv icon