Picture for Aurélien Bibaut

Aurélien Bibaut

Reward Transfer from Inverse Reinforcement Learning: A Coupled Minimax Approach

Add code
May 27, 2026
Viaarxiv icon

Semiparametric Efficient Bilevel Gradient Estimation

Add code
May 20, 2026
Viaarxiv icon

Instrumental Variable Analysis Without Structural Equations

Add code
Apr 27, 2026
Viaarxiv icon

Efficient Inference after Directionally Stable Adaptive Experiments

Add code
Feb 25, 2026
Viaarxiv icon

The Value of Personalized Recommendations: Evidence from Netflix

Add code
Nov 11, 2025
Viaarxiv icon

Nonparametric Instrumental Variable Inference with Many Weak Instruments

Add code
May 12, 2025
Viaarxiv icon

Automatic Double Reinforcement Learning in Semiparametric Markov Decision Processes with Applications to Long-Term Causal Inference

Add code
Jan 12, 2025
Figure 1 for Automatic Double Reinforcement Learning in Semiparametric Markov Decision Processes with Applications to Long-Term Causal Inference
Figure 2 for Automatic Double Reinforcement Learning in Semiparametric Markov Decision Processes with Applications to Long-Term Causal Inference
Figure 3 for Automatic Double Reinforcement Learning in Semiparametric Markov Decision Processes with Applications to Long-Term Causal Inference
Viaarxiv icon

Demistifying Inference after Adaptive Experiments

Add code
May 02, 2024
Viaarxiv icon

Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning

Add code
Jun 03, 2021
Figure 1 for Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning
Figure 2 for Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning
Figure 3 for Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning
Viaarxiv icon

Post-Contextual-Bandit Inference

Add code
Jun 01, 2021
Figure 1 for Post-Contextual-Bandit Inference
Figure 2 for Post-Contextual-Bandit Inference
Figure 3 for Post-Contextual-Bandit Inference
Figure 4 for Post-Contextual-Bandit Inference
Viaarxiv icon