Picture for Alexis Jacq

Alexis Jacq

C3PO: Learning to Achieve Arbitrary Goals via Massively Entropic Pretraining

Nov 07, 2022
Figure 1 for C3PO: Learning to Achieve Arbitrary Goals via Massively Entropic Pretraining
Figure 2 for C3PO: Learning to Achieve Arbitrary Goals via Massively Entropic Pretraining
Figure 3 for C3PO: Learning to Achieve Arbitrary Goals via Massively Entropic Pretraining
Figure 4 for C3PO: Learning to Achieve Arbitrary Goals via Massively Entropic Pretraining
Viaarxiv icon

Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act

Mar 16, 2022
Figure 1 for Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Figure 2 for Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Figure 3 for Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Figure 4 for Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Viaarxiv icon

Foolproof Cooperative Learning

Jun 24, 2019
Figure 1 for Foolproof Cooperative Learning
Figure 2 for Foolproof Cooperative Learning
Figure 3 for Foolproof Cooperative Learning
Viaarxiv icon

Cognitive Architecture for Mutual Modelling

Feb 22, 2016
Figure 1 for Cognitive Architecture for Mutual Modelling
Viaarxiv icon