Alert button
Picture for Maxime Robeyns

Maxime Robeyns

Alert button

Bayesian Reward Models for LLM Alignment

Add code
Bookmark button
Alert button
Feb 20, 2024
Adam X. Yang, Maxime Robeyns, Thomas Coste, Jun Wang, Haitham Bou-Ammar, Laurence Aitchison

Viaarxiv icon

Bayesian low-rank adaptation for large language models

Add code
Bookmark button
Alert button
Aug 28, 2023
Adam X. Yang, Maxime Robeyns, Xi Wang, Laurence Aitchison

Viaarxiv icon

Taylor TD-learning

Add code
Bookmark button
Alert button
Feb 27, 2023
Michele Garibbo, Maxime Robeyns, Laurence Aitchison

Figure 1 for Taylor TD-learning
Figure 2 for Taylor TD-learning
Figure 3 for Taylor TD-learning
Figure 4 for Taylor TD-learning
Viaarxiv icon