Picture for Alexandre Drouin

Alexandre Drouin

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

Add code
Jul 08, 2024
Viaarxiv icon

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks

Add code
Jul 07, 2024
Viaarxiv icon

Evaluating Interventional Reasoning Capabilities of Large Language Models

Add code
Apr 08, 2024
Viaarxiv icon

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

Add code
Mar 12, 2024
Figure 1 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 2 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 3 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 4 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Viaarxiv icon

Capture the Flag: Uncovering Data Insights with Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon

Lag-Llama: Towards Foundation Models for Time Series Forecasting

Add code
Oct 12, 2023
Figure 1 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 2 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 3 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 4 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Viaarxiv icon

TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series

Add code
Oct 02, 2023
Figure 1 for TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series
Figure 2 for TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series
Figure 3 for TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series
Figure 4 for TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series
Viaarxiv icon

Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation

Add code
Jul 30, 2023
Figure 1 for Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
Figure 2 for Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
Figure 3 for Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
Figure 4 for Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
Viaarxiv icon

Causal Discovery with Language Models as Imperfect Experts

Add code
Jul 05, 2023
Figure 1 for Causal Discovery with Language Models as Imperfect Experts
Figure 2 for Causal Discovery with Language Models as Imperfect Experts
Figure 3 for Causal Discovery with Language Models as Imperfect Experts
Figure 4 for Causal Discovery with Language Models as Imperfect Experts
Viaarxiv icon

Invariant Causal Set Covering Machines

Add code
Jun 07, 2023
Figure 1 for Invariant Causal Set Covering Machines
Figure 2 for Invariant Causal Set Covering Machines
Figure 3 for Invariant Causal Set Covering Machines
Figure 4 for Invariant Causal Set Covering Machines
Viaarxiv icon