Picture for Maxime Robeyns

Maxime Robeyns

A Self-Improving Coding Agent

Add code
Apr 21, 2025
Viaarxiv icon

Bayesian Reward Models for LLM Alignment

Add code
Feb 20, 2024
Figure 1 for Bayesian Reward Models for LLM Alignment
Figure 2 for Bayesian Reward Models for LLM Alignment
Figure 3 for Bayesian Reward Models for LLM Alignment
Viaarxiv icon

Bayesian low-rank adaptation for large language models

Add code
Aug 28, 2023
Figure 1 for Bayesian low-rank adaptation for large language models
Figure 2 for Bayesian low-rank adaptation for large language models
Figure 3 for Bayesian low-rank adaptation for large language models
Figure 4 for Bayesian low-rank adaptation for large language models
Viaarxiv icon

Taylor TD-learning

Add code
Feb 27, 2023
Figure 1 for Taylor TD-learning
Figure 2 for Taylor TD-learning
Figure 3 for Taylor TD-learning
Figure 4 for Taylor TD-learning
Viaarxiv icon