Alert button
Picture for Keshav Santhanam

Keshav Santhanam

Alert button

ALTO: An Efficient Network Orchestrator for Compound AI Systems

Add code
Bookmark button
Alert button
Mar 07, 2024
Keshav Santhanam, Deepti Raghavan, Muhammad Shahir Rahman, Thejas Venkatesh, Neha Kunjal, Pratiksha Thaker, Philip Levis, Matei Zaharia

Figure 1 for ALTO: An Efficient Network Orchestrator for Compound AI Systems
Figure 2 for ALTO: An Efficient Network Orchestrator for Compound AI Systems
Figure 3 for ALTO: An Efficient Network Orchestrator for Compound AI Systems
Figure 4 for ALTO: An Efficient Network Orchestrator for Compound AI Systems
Viaarxiv icon

DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Add code
Bookmark button
Alert button
Oct 05, 2023
Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts

Viaarxiv icon

Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs

Add code
Bookmark button
Alert button
May 03, 2023
Deepak Narayanan, Keshav Santhanam, Peter Henderson, Rishi Bommasani, Tony Lee, Percy Liang

Figure 1 for Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs
Figure 2 for Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs
Figure 3 for Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs
Figure 4 for Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs
Viaarxiv icon

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

Add code
Bookmark button
Alert button
Mar 01, 2023
Jon Saad-Falcon, Omar Khattab, Keshav Santhanam, Radu Florian, Martin Franz, Salim Roukos, Avirup Sil, Md Arafat Sultan, Christopher Potts

Figure 1 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 2 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 3 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 4 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Viaarxiv icon

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

Add code
Bookmark button
Alert button
Dec 28, 2022
Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang, Christopher Potts, Matei Zaharia

Figure 1 for Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Figure 2 for Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Figure 3 for Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
Viaarxiv icon

Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

Add code
Bookmark button
Alert button
Dec 02, 2022
Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts

Figure 1 for Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Figure 2 for Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Figure 3 for Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Figure 4 for Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Bookmark button
Alert button
Nov 16, 2022
Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-Navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda

Figure 1 for Holistic Evaluation of Language Models
Figure 2 for Holistic Evaluation of Language Models
Figure 3 for Holistic Evaluation of Language Models
Figure 4 for Holistic Evaluation of Language Models
Viaarxiv icon

PLAID: An Efficient Engine for Late Interaction Retrieval

Add code
Bookmark button
Alert button
May 19, 2022
Keshav Santhanam, Omar Khattab, Christopher Potts, Matei Zaharia

Figure 1 for PLAID: An Efficient Engine for Late Interaction Retrieval
Figure 2 for PLAID: An Efficient Engine for Late Interaction Retrieval
Figure 3 for PLAID: An Efficient Engine for Late Interaction Retrieval
Figure 4 for PLAID: An Efficient Engine for Late Interaction Retrieval
Viaarxiv icon

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

Add code
Bookmark button
Alert button
Dec 16, 2021
Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia

Figure 1 for ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Figure 2 for ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Figure 3 for ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Figure 4 for ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction
Viaarxiv icon