Picture for Matthieu Geist

Matthieu Geist

INRIA Lorraine - LORIA

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Learning Discrete-Time Major-Minor Mean Field Games

Add code
Dec 17, 2023
Figure 1 for Learning Discrete-Time Major-Minor Mean Field Games
Figure 2 for Learning Discrete-Time Major-Minor Mean Field Games
Figure 3 for Learning Discrete-Time Major-Minor Mean Field Games
Figure 4 for Learning Discrete-Time Major-Minor Mean Field Games
Viaarxiv icon

Nash Learning from Human Feedback

Add code
Dec 06, 2023
Figure 1 for Nash Learning from Human Feedback
Figure 2 for Nash Learning from Human Feedback
Figure 3 for Nash Learning from Human Feedback
Figure 4 for Nash Learning from Human Feedback
Viaarxiv icon

A Survey of Temporal Credit Assignment in Deep Reinforcement Learning

Add code
Dec 02, 2023
Figure 1 for A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
Figure 2 for A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
Figure 3 for A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
Figure 4 for A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
Viaarxiv icon

DRIFT: Deep Reinforcement Learning for Intelligent Floating Platforms Trajectories

Add code
Oct 06, 2023
Figure 1 for DRIFT: Deep Reinforcement Learning for Intelligent Floating Platforms Trajectories
Figure 2 for DRIFT: Deep Reinforcement Learning for Intelligent Floating Platforms Trajectories
Figure 3 for DRIFT: Deep Reinforcement Learning for Intelligent Floating Platforms Trajectories
Figure 4 for DRIFT: Deep Reinforcement Learning for Intelligent Floating Platforms Trajectories
Viaarxiv icon

Offline Reinforcement Learning with On-Policy Q-Function Regularization

Add code
Jul 25, 2023
Figure 1 for Offline Reinforcement Learning with On-Policy Q-Function Regularization
Figure 2 for Offline Reinforcement Learning with On-Policy Q-Function Regularization
Figure 3 for Offline Reinforcement Learning with On-Policy Q-Function Regularization
Figure 4 for Offline Reinforcement Learning with On-Policy Q-Function Regularization
Viaarxiv icon

A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning

Add code
Jul 24, 2023
Viaarxiv icon

On Imitation in Mean-field Games

Add code
Jun 26, 2023
Figure 1 for On Imitation in Mean-field Games
Figure 2 for On Imitation in Mean-field Games
Viaarxiv icon

GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models

Add code
Jun 23, 2023
Figure 1 for GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
Figure 2 for GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
Figure 3 for GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
Figure 4 for GKD: Generalized Knowledge Distillation for Auto-regressive Sequence Models
Viaarxiv icon

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

Add code
May 31, 2023
Figure 1 for Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Figure 2 for Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Figure 3 for Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Figure 4 for Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback
Viaarxiv icon