Picture for Michael Cafarella

Michael Cafarella

Abacus: A Cost-Based Optimizer for Semantic Operator Systems

Add code
May 20, 2025
Viaarxiv icon

Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation

Add code
May 20, 2025
Viaarxiv icon

Causal DAG Summarization (Full Version)

Add code
Apr 21, 2025
Viaarxiv icon

EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline

Add code
Apr 04, 2025
Viaarxiv icon

PalimpChat: Declarative and Interactive AI analytics

Add code
Feb 05, 2025
Viaarxiv icon

Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method

Add code
Jan 30, 2025
Viaarxiv icon

Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs

Add code
Oct 23, 2024
Figure 1 for Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs
Figure 2 for Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs
Figure 3 for Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs
Figure 4 for Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs
Viaarxiv icon

BEAVER: An Enterprise Benchmark for Text-to-SQL

Add code
Sep 03, 2024
Viaarxiv icon

MDCR: A Dataset for Multi-Document Conditional Reasoning

Add code
Jun 17, 2024
Viaarxiv icon

A Declarative System for Optimizing AI Workloads

Add code
May 23, 2024
Figure 1 for A Declarative System for Optimizing AI Workloads
Figure 2 for A Declarative System for Optimizing AI Workloads
Figure 3 for A Declarative System for Optimizing AI Workloads
Figure 4 for A Declarative System for Optimizing AI Workloads
Viaarxiv icon