Picture for Olivier Delalleau

Olivier Delalleau

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

HelpSteer2: Open-source dataset for training top-performing reward models

Add code
Jun 12, 2024
Viaarxiv icon

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Add code
May 02, 2024
Viaarxiv icon

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Add code
Nov 16, 2023
Figure 1 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 2 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 3 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Figure 4 for HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM
Viaarxiv icon

IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control

Add code
Jun 01, 2023
Figure 1 for IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Figure 2 for IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Figure 3 for IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Figure 4 for IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Viaarxiv icon

A Closer Look at Codistillation for Distributed Training

Add code
Oct 06, 2020
Figure 1 for A Closer Look at Codistillation for Distributed Training
Figure 2 for A Closer Look at Codistillation for Distributed Training
Figure 3 for A Closer Look at Codistillation for Distributed Training
Figure 4 for A Closer Look at Codistillation for Distributed Training
Viaarxiv icon

Discrete and Continuous Action Representation for Practical RL in Video Games

Add code
Dec 23, 2019
Figure 1 for Discrete and Continuous Action Representation for Practical RL in Video Games
Figure 2 for Discrete and Continuous Action Representation for Practical RL in Video Games
Figure 3 for Discrete and Continuous Action Representation for Practical RL in Video Games
Figure 4 for Discrete and Continuous Action Representation for Practical RL in Video Games
Viaarxiv icon

Efficient EM Training of Gaussian Mixtures with Missing Data

Add code
Jan 08, 2018
Figure 1 for Efficient EM Training of Gaussian Mixtures with Missing Data
Figure 2 for Efficient EM Training of Gaussian Mixtures with Missing Data
Figure 3 for Efficient EM Training of Gaussian Mixtures with Missing Data
Viaarxiv icon

Theano: A Python framework for fast computation of mathematical expressions

Add code
May 09, 2016
Figure 1 for Theano: A Python framework for fast computation of mathematical expressions
Figure 2 for Theano: A Python framework for fast computation of mathematical expressions
Figure 3 for Theano: A Python framework for fast computation of mathematical expressions
Figure 4 for Theano: A Python framework for fast computation of mathematical expressions
Viaarxiv icon