Alert button
Picture for Olivier Delalleau

Olivier Delalleau

Alert button

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM

Add code
Bookmark button
Alert button
Nov 16, 2023
Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant, Aidan Swope, Oleksii Kuchaiev

Viaarxiv icon

IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control

Add code
Bookmark button
Alert button
Jun 01, 2023
Rohan Chitnis, Yingchen Xu, Bobak Hashemi, Lucas Lehnert, Urun Dogan, Zheqing Zhu, Olivier Delalleau

Figure 1 for IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Figure 2 for IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Figure 3 for IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Figure 4 for IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Viaarxiv icon

A Closer Look at Codistillation for Distributed Training

Add code
Bookmark button
Alert button
Oct 06, 2020
Shagun Sodhani, Olivier Delalleau, Mahmoud Assran, Koustuv Sinha, Nicolas Ballas, Michael Rabbat

Figure 1 for A Closer Look at Codistillation for Distributed Training
Figure 2 for A Closer Look at Codistillation for Distributed Training
Figure 3 for A Closer Look at Codistillation for Distributed Training
Figure 4 for A Closer Look at Codistillation for Distributed Training
Viaarxiv icon

Discrete and Continuous Action Representation for Practical RL in Video Games

Add code
Bookmark button
Alert button
Dec 23, 2019
Olivier Delalleau, Maxim Peter, Eloi Alonso, Adrien Logut

Figure 1 for Discrete and Continuous Action Representation for Practical RL in Video Games
Figure 2 for Discrete and Continuous Action Representation for Practical RL in Video Games
Figure 3 for Discrete and Continuous Action Representation for Practical RL in Video Games
Figure 4 for Discrete and Continuous Action Representation for Practical RL in Video Games
Viaarxiv icon

Efficient EM Training of Gaussian Mixtures with Missing Data

Add code
Bookmark button
Alert button
Jan 08, 2018
Olivier Delalleau, Aaron Courville, Yoshua Bengio

Figure 1 for Efficient EM Training of Gaussian Mixtures with Missing Data
Figure 2 for Efficient EM Training of Gaussian Mixtures with Missing Data
Figure 3 for Efficient EM Training of Gaussian Mixtures with Missing Data
Viaarxiv icon

Theano: A Python framework for fast computation of mathematical expressions

Add code
Bookmark button
Alert button
May 09, 2016
The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Mélanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian Goodfellow, Matt Graham, Caglar Gulcehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrancois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert T. McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang

Figure 1 for Theano: A Python framework for fast computation of mathematical expressions
Figure 2 for Theano: A Python framework for fast computation of mathematical expressions
Figure 3 for Theano: A Python framework for fast computation of mathematical expressions
Figure 4 for Theano: A Python framework for fast computation of mathematical expressions
Viaarxiv icon