Alert button
Picture for Prateek Jain

Prateek Jain

Alert button

Gecko: Versatile Text Embeddings Distilled from Large Language Models

Add code
Bookmark button
Alert button
Mar 29, 2024
Jinhyuk Lee, Zhuyun Dai, Xiaoqi Ren, Blair Chen, Daniel Cer, Jeremy R. Cole, Kai Hui, Michael Boratko, Rajvi Kapadia, Wen Ding, Yi Luan, Sai Meher Karthik Duddu, Gustavo Hernandez Abrego, Weiqiang Shi, Nithi Gupta, Aditya Kusupati, Prateek Jain, Siddhartha Reddy Jonnalagadda, Ming-Wei Chang, Iftekhar Naim

Figure 1 for Gecko: Versatile Text Embeddings Distilled from Large Language Models
Figure 2 for Gecko: Versatile Text Embeddings Distilled from Large Language Models
Figure 3 for Gecko: Versatile Text Embeddings Distilled from Large Language Models
Figure 4 for Gecko: Versatile Text Embeddings Distilled from Large Language Models
Viaarxiv icon

HiRE: High Recall Approximate Top-$k$ Estimation for Efficient LLM Inference

Add code
Bookmark button
Alert button
Feb 14, 2024
Yashas Samaga B L, Varun Yerram, Chong You, Srinadh Bhojanapalli, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

Viaarxiv icon

Tandem Transformers for Inference Efficient LLMs

Add code
Bookmark button
Alert button
Feb 13, 2024
Aishwarya P S, Pranav Ajit Nair, Yashas Samaga, Toby Boyd, Sanjiv Kumar, Prateek Jain, Praneeth Netrapalli

Viaarxiv icon

LLM Augmented LLMs: Expanding Capabilities through Composition

Add code
Bookmark button
Alert button
Jan 04, 2024
Rachit Bansal, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Shikhar Vashishth, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, Partha Talukdar

Viaarxiv icon

Blocked Collaborative Bandits: Online Collaborative Filtering with Per-Item Budget Constraints

Add code
Bookmark button
Alert button
Oct 31, 2023
Soumyabrata Pal, Arun Sai Suggala, Karthikeyan Shanmugam, Prateek Jain

Viaarxiv icon

Efficacy of Dual-Encoders for Extreme Multi-Label Classification

Add code
Bookmark button
Alert button
Oct 16, 2023
Nilesh Gupta, Devvrit Khatri, Ankit S Rawat, Srinadh Bhojanapalli, Prateek Jain, Inderjit S Dhillon

Figure 1 for Efficacy of Dual-Encoders for Extreme Multi-Label Classification
Figure 2 for Efficacy of Dual-Encoders for Extreme Multi-Label Classification
Figure 3 for Efficacy of Dual-Encoders for Extreme Multi-Label Classification
Figure 4 for Efficacy of Dual-Encoders for Extreme Multi-Label Classification
Viaarxiv icon

EHI: End-to-end Learning of Hierarchical Index for Efficient Dense Retrieval

Add code
Bookmark button
Alert button
Oct 13, 2023
Ramnath Kumar, Anshul Mittal, Nilesh Gupta, Aditya Kusupati, Inderjit Dhillon, Prateek Jain

Figure 1 for EHI: End-to-end Learning of Hierarchical Index for Efficient Dense Retrieval
Figure 2 for EHI: End-to-end Learning of Hierarchical Index for Efficient Dense Retrieval
Figure 3 for EHI: End-to-end Learning of Hierarchical Index for Efficient Dense Retrieval
Figure 4 for EHI: End-to-end Learning of Hierarchical Index for Efficient Dense Retrieval
Viaarxiv icon

MatFormer: Nested Transformer for Elastic Inference

Add code
Bookmark button
Alert button
Oct 11, 2023
Devvrit, Sneha Kudugunta, Aditya Kusupati, Tim Dettmers, Kaifeng Chen, Inderjit Dhillon, Yulia Tsvetkov, Hannaneh Hajishirzi, Sham Kakade, Ali Farhadi, Prateek Jain

Figure 1 for MatFormer: Nested Transformer for Elastic Inference
Figure 2 for MatFormer: Nested Transformer for Elastic Inference
Figure 3 for MatFormer: Nested Transformer for Elastic Inference
Figure 4 for MatFormer: Nested Transformer for Elastic Inference
Viaarxiv icon

End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates

Add code
Bookmark button
Alert button
Jun 13, 2023
Anshul Nasery, Hardik Shah, Arun Sai Suggala, Prateek Jain

Figure 1 for End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates
Figure 2 for End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates
Figure 3 for End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates
Figure 4 for End-to-End Neural Network Compression via $\frac{\ell_1}{\ell_2}$ Regularized Latency Surrogates
Viaarxiv icon

AdANNS: A Framework for Adaptive Semantic Search

Add code
Bookmark button
Alert button
May 30, 2023
Aniket Rege, Aditya Kusupati, Sharan Ranjit S, Alan Fan, Qingqing Cao, Sham Kakade, Prateek Jain, Ali Farhadi

Figure 1 for AdANNS: A Framework for Adaptive Semantic Search
Figure 2 for AdANNS: A Framework for Adaptive Semantic Search
Figure 3 for AdANNS: A Framework for Adaptive Semantic Search
Figure 4 for AdANNS: A Framework for Adaptive Semantic Search
Viaarxiv icon