Picture for Alexandre Drouin

Alexandre Drouin

Evaluating Interventional Reasoning Capabilities of Large Language Models

Add code
Apr 08, 2024
Viaarxiv icon

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?

Add code
Mar 12, 2024
Figure 1 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 2 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 3 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Figure 4 for WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Viaarxiv icon

Capture the Flag: Uncovering Data Insights with Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon

Lag-Llama: Towards Foundation Models for Time Series Forecasting

Add code
Oct 12, 2023
Figure 1 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 2 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 3 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Figure 4 for Lag-Llama: Towards Foundation Models for Time Series Forecasting
Viaarxiv icon

TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series

Add code
Oct 02, 2023
Figure 1 for TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series
Figure 2 for TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series
Figure 3 for TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series
Figure 4 for TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series
Viaarxiv icon

Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation

Add code
Jul 30, 2023
Figure 1 for Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
Figure 2 for Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
Figure 3 for Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
Figure 4 for Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation
Viaarxiv icon

Causal Discovery with Language Models as Imperfect Experts

Add code
Jul 05, 2023
Viaarxiv icon

Invariant Causal Set Covering Machines

Add code
Jun 07, 2023
Figure 1 for Invariant Causal Set Covering Machines
Figure 2 for Invariant Causal Set Covering Machines
Figure 3 for Invariant Causal Set Covering Machines
Figure 4 for Invariant Causal Set Covering Machines
Viaarxiv icon

GEO-Bench: Toward Foundation Models for Earth Monitoring

Add code
Jun 06, 2023
Figure 1 for GEO-Bench: Toward Foundation Models for Earth Monitoring
Figure 2 for GEO-Bench: Toward Foundation Models for Earth Monitoring
Figure 3 for GEO-Bench: Toward Foundation Models for Earth Monitoring
Figure 4 for GEO-Bench: Toward Foundation Models for Earth Monitoring
Viaarxiv icon

Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts

Add code
Apr 19, 2023
Figure 1 for Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts
Figure 2 for Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts
Figure 3 for Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts
Figure 4 for Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts
Viaarxiv icon