Picture for Otmane Sakhi

Otmane Sakhi

Learning to Bid in Repeated Second-Price Auctions with Dynamic Values and Aggregated Feedback

Add code
May 27, 2026
Viaarxiv icon

Off-Policy Learning to Reason Works Because It Is More Pessimistic Than You Think

Add code
May 27, 2026
Viaarxiv icon

Self-Consistency via Marginal Sharpening

Add code
May 27, 2026
Viaarxiv icon

Data Valuation for LLM Fine-Tuning: Efficient Shapley Value Approximation via Language Model Arithmetic

Add code
Dec 12, 2025
Figure 1 for Data Valuation for LLM Fine-Tuning: Efficient Shapley Value Approximation via Language Model Arithmetic
Viaarxiv icon

Off-Policy Learning in Large Action Spaces: Optimization Matters More Than Estimation

Add code
Sep 03, 2025
Figure 1 for Off-Policy Learning in Large Action Spaces: Optimization Matters More Than Estimation
Figure 2 for Off-Policy Learning in Large Action Spaces: Optimization Matters More Than Estimation
Figure 3 for Off-Policy Learning in Large Action Spaces: Optimization Matters More Than Estimation
Viaarxiv icon

Non-Linear Counterfactual Aggregate Optimization

Add code
Sep 03, 2025
Viaarxiv icon

Logarithmic Smoothing for Adaptive PAC-Bayesian Off-Policy Learning

Add code
Jun 12, 2025
Viaarxiv icon

Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning

Add code
May 23, 2024
Figure 1 for Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Figure 2 for Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Figure 3 for Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Figure 4 for Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Viaarxiv icon

Fast Slate Policy Optimization: Going Beyond Plackett-Luce

Add code
Aug 03, 2023
Viaarxiv icon

PAC-Bayesian Offline Contextual Bandits With Guarantees

Add code
Oct 24, 2022
Figure 1 for PAC-Bayesian Offline Contextual Bandits With Guarantees
Figure 2 for PAC-Bayesian Offline Contextual Bandits With Guarantees
Figure 3 for PAC-Bayesian Offline Contextual Bandits With Guarantees
Figure 4 for PAC-Bayesian Offline Contextual Bandits With Guarantees
Viaarxiv icon