Picture for David Vazquez

David Vazquez

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

Add code
Jul 08, 2024
Viaarxiv icon

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Add code
Jun 17, 2024
Figure 1 for RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
Figure 2 for RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
Figure 3 for RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
Figure 4 for RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
Viaarxiv icon

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

Add code
Mar 12, 2024
Figure 1 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 2 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 3 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 4 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Viaarxiv icon

Capture the Flag: Uncovering Data Insights with Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon

StarVector: Generating Scalable Vector Graphics Code from Images

Add code
Dec 17, 2023
Viaarxiv icon

Group Robust Classification Without Any Group Information

Add code
Oct 28, 2023
Figure 1 for Group Robust Classification Without Any Group Information
Figure 2 for Group Robust Classification Without Any Group Information
Figure 3 for Group Robust Classification Without Any Group Information
Figure 4 for Group Robust Classification Without Any Group Information
Viaarxiv icon

OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning

Add code
Oct 28, 2023
Figure 1 for OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning
Figure 2 for OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning
Figure 3 for OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning
Figure 4 for OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning
Viaarxiv icon

Expecting The Unexpected: Towards Broad Out-Of-Distribution Detection

Add code
Aug 22, 2023
Viaarxiv icon

FigGen: Text to Scientific Figure Generation

Add code
Jun 21, 2023
Figure 1 for FigGen: Text to Scientific Figure Generation
Figure 2 for FigGen: Text to Scientific Figure Generation
Figure 3 for FigGen: Text to Scientific Figure Generation
Figure 4 for FigGen: Text to Scientific Figure Generation
Viaarxiv icon

GEO-Bench: Toward Foundation Models for Earth Monitoring

Add code
Jun 06, 2023
Figure 1 for GEO-Bench: Toward Foundation Models for Earth Monitoring
Figure 2 for GEO-Bench: Toward Foundation Models for Earth Monitoring
Figure 3 for GEO-Bench: Toward Foundation Models for Earth Monitoring
Figure 4 for GEO-Bench: Toward Foundation Models for Earth Monitoring
Viaarxiv icon