Picture for Shreya Shankar

Shreya Shankar

PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines

Add code
Apr 20, 2025
Viaarxiv icon

RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines

Add code
Apr 18, 2025
Viaarxiv icon

LLM-Powered Proactive Data Systems

Add code
Feb 18, 2025
Viaarxiv icon

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

Add code
Oct 16, 2024
Figure 1 for DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
Figure 2 for DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
Figure 3 for DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
Figure 4 for DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
Viaarxiv icon

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Add code
Apr 18, 2024
Figure 1 for Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences
Figure 2 for Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences
Figure 3 for Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences
Figure 4 for Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences
Viaarxiv icon

Revisiting Prompt Engineering via Declarative Crowdsourcing

Add code
Aug 07, 2023
Viaarxiv icon

Operationalizing Machine Learning: An Interview Study

Add code
Sep 16, 2022
Figure 1 for Operationalizing Machine Learning: An Interview Study
Figure 2 for Operationalizing Machine Learning: An Interview Study
Figure 3 for Operationalizing Machine Learning: An Interview Study
Figure 4 for Operationalizing Machine Learning: An Interview Study
Viaarxiv icon

Rethinking Streaming Machine Learning Evaluation

Add code
May 23, 2022
Figure 1 for Rethinking Streaming Machine Learning Evaluation
Figure 2 for Rethinking Streaming Machine Learning Evaluation
Figure 3 for Rethinking Streaming Machine Learning Evaluation
Viaarxiv icon

Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming

Add code
Nov 03, 2020
Figure 1 for Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming
Figure 2 for Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming
Figure 3 for Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming
Figure 4 for Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming
Viaarxiv icon

Optimal Transfer Learning Model for Binary Classification of Funduscopic Images through Simple Heuristics

Add code
Feb 20, 2020
Figure 1 for Optimal Transfer Learning Model for Binary Classification of Funduscopic Images through Simple Heuristics
Figure 2 for Optimal Transfer Learning Model for Binary Classification of Funduscopic Images through Simple Heuristics
Viaarxiv icon