Picture for Omar Khattab

Omar Khattab

Vector Policy Optimization: Training for Diversity Improves Test-Time Search

Add code
May 21, 2026
Viaarxiv icon

PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents

Add code
May 19, 2026
Viaarxiv icon

optimize_anything: A Universal API for Optimizing any Text Parameter

Add code
May 19, 2026
Viaarxiv icon

OBLIQ-Bench: Exposing Overlooked Bottlenecks in Modern Retrievers with Latent and Implicit Queries

Add code
May 07, 2026
Viaarxiv icon

Meta-Harness: End-to-End Optimization of Model Harnesses

Add code
Mar 30, 2026
Viaarxiv icon

Recursive Language Models

Add code
Dec 31, 2025
Viaarxiv icon

Reasoning-Intensive Regression

Add code
Aug 29, 2025
Figure 1 for Reasoning-Intensive Regression
Figure 2 for Reasoning-Intensive Regression
Figure 3 for Reasoning-Intensive Regression
Figure 4 for Reasoning-Intensive Regression
Viaarxiv icon

Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs

Add code
Aug 06, 2025
Viaarxiv icon

ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring

Add code
Apr 21, 2025
Viaarxiv icon

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents

Add code
Apr 17, 2025
Figure 1 for FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents
Figure 2 for FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents
Figure 3 for FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents
Figure 4 for FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents
Viaarxiv icon