Picture for Branislav Kveton

Branislav Kveton

Off-Policy Evaluation from Logged Human Feedback

Add code
Jun 14, 2024
Viaarxiv icon

Cross-Validated Off-Policy Evaluation

Add code
May 27, 2024
Viaarxiv icon

Optimal Design for Human Feedback

Add code
Apr 22, 2024
Viaarxiv icon

Experimental Design for Active Transductive Inference in Large Language Models

Add code
Apr 12, 2024
Figure 1 for Experimental Design for Active Transductive Inference in Large Language Models
Figure 2 for Experimental Design for Active Transductive Inference in Large Language Models
Figure 3 for Experimental Design for Active Transductive Inference in Large Language Models
Figure 4 for Experimental Design for Active Transductive Inference in Large Language Models
Viaarxiv icon

MADA: Meta-Adaptive Optimizers through hyper-gradient Descent

Add code
Jan 17, 2024
Figure 1 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 2 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 3 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Figure 4 for MADA: Meta-Adaptive Optimizers through hyper-gradient Descent
Viaarxiv icon

Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs

Add code
Dec 22, 2023
Figure 1 for Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs
Figure 2 for Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs
Figure 3 for Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs
Figure 4 for Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs
Viaarxiv icon

Pre-trained Recommender Systems: A Causal Debiasing Perspective

Add code
Oct 30, 2023
Figure 1 for Pre-trained Recommender Systems: A Causal Debiasing Perspective
Figure 2 for Pre-trained Recommender Systems: A Causal Debiasing Perspective
Figure 3 for Pre-trained Recommender Systems: A Causal Debiasing Perspective
Figure 4 for Pre-trained Recommender Systems: A Causal Debiasing Perspective
Viaarxiv icon

Pessimistic Off-Policy Multi-Objective Optimization

Add code
Oct 28, 2023
Figure 1 for Pessimistic Off-Policy Multi-Objective Optimization
Figure 2 for Pessimistic Off-Policy Multi-Objective Optimization
Figure 3 for Pessimistic Off-Policy Multi-Objective Optimization
Figure 4 for Pessimistic Off-Policy Multi-Objective Optimization
Viaarxiv icon

Efficient and Interpretable Bandit Algorithms

Add code
Oct 23, 2023
Figure 1 for Efficient and Interpretable Bandit Algorithms
Figure 2 for Efficient and Interpretable Bandit Algorithms
Figure 3 for Efficient and Interpretable Bandit Algorithms
Figure 4 for Efficient and Interpretable Bandit Algorithms
Viaarxiv icon

Logarithmic Bayes Regret Bounds

Add code
Jun 15, 2023
Figure 1 for Logarithmic Bayes Regret Bounds
Figure 2 for Logarithmic Bayes Regret Bounds
Viaarxiv icon