Alert button
Picture for Nando de Freitas

Nando de Freitas

Alert button

Active Offline Policy Selection

Add code
Bookmark button
Alert button
Jun 18, 2021
Ksenia Konyushkova, Yutian Chen, Thomas Paine, Caglar Gulcehre, Cosmin Paduraru, Daniel J Mankowitz, Misha Denil, Nando de Freitas

Figure 1 for Active Offline Policy Selection
Figure 2 for Active Offline Policy Selection
Figure 3 for Active Offline Policy Selection
Figure 4 for Active Offline Policy Selection
Viaarxiv icon

On Instrumental Variable Regression for Deep Offline Policy Evaluation

Add code
Bookmark button
Alert button
May 21, 2021
Yutian Chen, Liyuan Xu, Caglar Gulcehre, Tom Le Paine, Arthur Gretton, Nando de Freitas, Arnaud Doucet

Figure 1 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 2 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 3 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Figure 4 for On Instrumental Variable Regression for Deep Offline Policy Evaluation
Viaarxiv icon

Regularized Behavior Value Estimation

Add code
Bookmark button
Alert button
Mar 17, 2021
Caglar Gulcehre, Sergio Gómez Colmenarejo, Ziyu Wang, Jakub Sygnowski, Thomas Paine, Konrad Zolna, Yutian Chen, Matthew Hoffman, Razvan Pascanu, Nando de Freitas

Figure 1 for Regularized Behavior Value Estimation
Figure 2 for Regularized Behavior Value Estimation
Figure 3 for Regularized Behavior Value Estimation
Figure 4 for Regularized Behavior Value Estimation
Viaarxiv icon

Semi-supervised reward learning for offline reinforcement learning

Add code
Bookmark button
Alert button
Dec 12, 2020
Ksenia Konyushkova, Konrad Zolna, Yusuf Aytar, Alexander Novikov, Scott Reed, Serkan Cabi, Nando de Freitas

Figure 1 for Semi-supervised reward learning for offline reinforcement learning
Figure 2 for Semi-supervised reward learning for offline reinforcement learning
Figure 3 for Semi-supervised reward learning for offline reinforcement learning
Figure 4 for Semi-supervised reward learning for offline reinforcement learning
Viaarxiv icon

Offline Learning from Demonstrations and Unlabeled Experience

Add code
Bookmark button
Alert button
Nov 27, 2020
Konrad Zolna, Alexander Novikov, Ksenia Konyushkova, Caglar Gulcehre, Ziyu Wang, Yusuf Aytar, Misha Denil, Nando de Freitas, Scott Reed

Figure 1 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 2 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 3 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 4 for Offline Learning from Demonstrations and Unlabeled Experience
Viaarxiv icon

Large-scale multilingual audio visual dubbing

Add code
Bookmark button
Alert button
Nov 06, 2020
Yi Yang, Brendan Shillingford, Yannis Assael, Miaosen Wang, Wendi Liu, Yutian Chen, Yu Zhang, Eren Sezener, Luis C. Cobo, Misha Denil, Yusuf Aytar, Nando de Freitas

Figure 1 for Large-scale multilingual audio visual dubbing
Figure 2 for Large-scale multilingual audio visual dubbing
Figure 3 for Large-scale multilingual audio visual dubbing
Figure 4 for Large-scale multilingual audio visual dubbing
Viaarxiv icon

Learning Deep Features in Instrumental Variable Regression

Add code
Bookmark button
Alert button
Nov 01, 2020
Liyuan Xu, Yutian Chen, Siddarth Srinivasan, Nando de Freitas, Arnaud Doucet, Arthur Gretton

Figure 1 for Learning Deep Features in Instrumental Variable Regression
Figure 2 for Learning Deep Features in Instrumental Variable Regression
Figure 3 for Learning Deep Features in Instrumental Variable Regression
Figure 4 for Learning Deep Features in Instrumental Variable Regression
Viaarxiv icon

Learning Compositional Neural Programs for Continuous Control

Add code
Bookmark button
Alert button
Jul 27, 2020
Thomas Pierrot, Nicolas Perrin, Feryal Behbahani, Alexandre Laterre, Olivier Sigaud, Karim Beguir, Nando de Freitas

Figure 1 for Learning Compositional Neural Programs for Continuous Control
Figure 2 for Learning Compositional Neural Programs for Continuous Control
Figure 3 for Learning Compositional Neural Programs for Continuous Control
Figure 4 for Learning Compositional Neural Programs for Continuous Control
Viaarxiv icon