Picture for Branislav Kveton

Branislav Kveton

Adobe Research

Off-Policy Evaluation from Logged Human Feedback

Add code
Jun 14, 2024
Figure 1 for Off-Policy Evaluation from Logged Human Feedback
Figure 2 for Off-Policy Evaluation from Logged Human Feedback
Figure 3 for Off-Policy Evaluation from Logged Human Feedback
Figure 4 for Off-Policy Evaluation from Logged Human Feedback
Viaarxiv icon

Cross-Validated Off-Policy Evaluation

Add code
May 27, 2024
Viaarxiv icon

Optimal Design for Human Feedback

Add code
Apr 22, 2024
Figure 1 for Optimal Design for Human Feedback
Figure 2 for Optimal Design for Human Feedback
Figure 3 for Optimal Design for Human Feedback
Viaarxiv icon

Experimental Design for Active Transductive Inference in Large Language Models

Add code
Apr 12, 2024
Figure 1 for Experimental Design for Active Transductive Inference in Large Language Models
Figure 2 for Experimental Design for Active Transductive Inference in Large Language Models
Figure 3 for Experimental Design for Active Transductive Inference in Large Language Models
Figure 4 for Experimental Design for Active Transductive Inference in Large Language Models
Viaarxiv icon

MADA: Meta-Adaptive Optimizers through hyper-gradient Descent

Add code
Jan 17, 2024
Figure 1 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 2 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 3 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 4 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Viaarxiv icon

Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs

Add code
Dec 22, 2023
Figure 1 for Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs
Figure 2 for Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs
Figure 3 for Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs
Figure 4 for Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs
Viaarxiv icon

Pre-trained Recommender Systems: A Causal Debiasing Perspective

Add code
Oct 30, 2023
Figure 1 for Pre-trained Recommender Systems: A Causal Debiasing Perspective
Figure 2 for Pre-trained Recommender Systems: A Causal Debiasing Perspective
Figure 3 for Pre-trained Recommender Systems: A Causal Debiasing Perspective
Figure 4 for Pre-trained Recommender Systems: A Causal Debiasing Perspective
Viaarxiv icon

Pessimistic Off-Policy Multi-Objective Optimization

Add code
Oct 28, 2023
Figure 1 for Pessimistic Off-Policy Multi-Objective Optimization
Figure 2 for Pessimistic Off-Policy Multi-Objective Optimization
Figure 3 for Pessimistic Off-Policy Multi-Objective Optimization
Figure 4 for Pessimistic Off-Policy Multi-Objective Optimization
Viaarxiv icon

Efficient and Interpretable Bandit Algorithms

Add code
Oct 23, 2023
Figure 1 for Efficient and Interpretable Bandit Algorithms
Figure 2 for Efficient and Interpretable Bandit Algorithms
Figure 3 for Efficient and Interpretable Bandit Algorithms
Figure 4 for Efficient and Interpretable Bandit Algorithms
Viaarxiv icon

Logarithmic Bayes Regret Bounds

Add code
Jun 15, 2023
Figure 1 for Logarithmic Bayes Regret Bounds
Figure 2 for Logarithmic Bayes Regret Bounds
Figure 3 for Logarithmic Bayes Regret Bounds
Viaarxiv icon