Picture for Jon Saad-Falcon

Jon Saad-Falcon

Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

Add code
Nov 14, 2025
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods

Add code
Apr 18, 2025
Viaarxiv icon

LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

Add code
Dec 17, 2024
Viaarxiv icon

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Add code
Feb 14, 2024
Viaarxiv icon

ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems

Add code
Nov 16, 2023
Figure 1 for ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems
Figure 2 for ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems
Figure 3 for ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems
Figure 4 for ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems
Viaarxiv icon

PDFTriage: Question Answering over Long, Structured Documents

Add code
Sep 16, 2023
Figure 1 for PDFTriage: Question Answering over Long, Structured Documents
Figure 2 for PDFTriage: Question Answering over Long, Structured Documents
Figure 3 for PDFTriage: Question Answering over Long, Structured Documents
Figure 4 for PDFTriage: Question Answering over Long, Structured Documents
Viaarxiv icon

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

Add code
Mar 01, 2023
Figure 1 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 2 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 3 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 4 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Viaarxiv icon

Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

Add code
Dec 02, 2022
Figure 1 for Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Figure 2 for Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Figure 3 for Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Figure 4 for Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking
Viaarxiv icon

Embedding Recycling for Language Models

Add code
Jul 11, 2022
Figure 1 for Embedding Recycling for Language Models
Figure 2 for Embedding Recycling for Language Models
Figure 3 for Embedding Recycling for Language Models
Figure 4 for Embedding Recycling for Language Models
Viaarxiv icon