Picture for David Vazquez

David Vazquez

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

Add code
Jul 08, 2024
Figure 1 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 2 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 3 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Figure 4 for InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation
Viaarxiv icon

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Add code
Jun 17, 2024
Viaarxiv icon

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

Add code
Mar 12, 2024
Viaarxiv icon

Capture the Flag: Uncovering Data Insights with Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon

StarVector: Generating Scalable Vector Graphics Code from Images

Add code
Dec 17, 2023
Viaarxiv icon

OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning

Add code
Oct 28, 2023
Viaarxiv icon

Group Robust Classification Without Any Group Information

Add code
Oct 28, 2023
Viaarxiv icon

Expecting The Unexpected: Towards Broad Out-Of-Distribution Detection

Add code
Aug 22, 2023
Viaarxiv icon

FigGen: Text to Scientific Figure Generation

Add code
Jun 21, 2023
Viaarxiv icon

GEO-Bench: Toward Foundation Models for Earth Monitoring

Add code
Jun 06, 2023
Viaarxiv icon