Alert button
Picture for Ksenia Konyushkova

Ksenia Konyushkova

Alert button

Reinforced Self-Training (ReST) for Language Modeling

Aug 21, 2023
Caglar Gulcehre, Tom Le Paine, Srivatsan Srinivasan, Ksenia Konyushkova, Lotte Weerts, Abhishek Sharma, Aditya Siddhant, Alex Ahern, Miaosen Wang, Chenjie Gu, Wolfgang Macherey, Arnaud Doucet, Orhan Firat, Nando de Freitas

Figure 1 for Reinforced Self-Training (ReST) for Language Modeling
Figure 2 for Reinforced Self-Training (ReST) for Language Modeling
Figure 3 for Reinforced Self-Training (ReST) for Language Modeling
Figure 4 for Reinforced Self-Training (ReST) for Language Modeling
Viaarxiv icon

$\pi2\text{vec}$: Policy Representations with Successor Features

Jun 16, 2023
Gianluca Scarpellini, Ksenia Konyushkova, Claudio Fantacci, Tom Le Paine, Yutian Chen, Misha Denil

Figure 1 for $\pi2\text{vec}$: Policy Representations with Successor Features
Figure 2 for $\pi2\text{vec}$: Policy Representations with Successor Features
Figure 3 for $\pi2\text{vec}$: Policy Representations with Successor Features
Figure 4 for $\pi2\text{vec}$: Policy Representations with Successor Features
Viaarxiv icon

Vision-Language Models as Success Detectors

Mar 13, 2023
Yuqing Du, Ksenia Konyushkova, Misha Denil, Akhil Raju, Jessica Landon, Felix Hill, Nando de Freitas, Serkan Cabi

Figure 1 for Vision-Language Models as Success Detectors
Figure 2 for Vision-Language Models as Success Detectors
Figure 3 for Vision-Language Models as Success Detectors
Figure 4 for Vision-Language Models as Success Detectors
Viaarxiv icon

Retrieval-Augmented Reinforcement Learning

Mar 09, 2022
Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C. Humphreys, Ksenia Konyushkova, Michal Valko, Simon Osindero, Timothy Lillicrap, Nicolas Heess, Charles Blundell

Figure 1 for Retrieval-Augmented Reinforcement Learning
Figure 2 for Retrieval-Augmented Reinforcement Learning
Figure 3 for Retrieval-Augmented Reinforcement Learning
Figure 4 for Retrieval-Augmented Reinforcement Learning
Viaarxiv icon

Active Offline Policy Selection

Jun 18, 2021
Ksenia Konyushkova, Yutian Chen, Thomas Paine, Caglar Gulcehre, Cosmin Paduraru, Daniel J Mankowitz, Misha Denil, Nando de Freitas

Figure 1 for Active Offline Policy Selection
Figure 2 for Active Offline Policy Selection
Figure 3 for Active Offline Policy Selection
Figure 4 for Active Offline Policy Selection
Viaarxiv icon

Semi-supervised reward learning for offline reinforcement learning

Dec 12, 2020
Ksenia Konyushkova, Konrad Zolna, Yusuf Aytar, Alexander Novikov, Scott Reed, Serkan Cabi, Nando de Freitas

Figure 1 for Semi-supervised reward learning for offline reinforcement learning
Figure 2 for Semi-supervised reward learning for offline reinforcement learning
Figure 3 for Semi-supervised reward learning for offline reinforcement learning
Figure 4 for Semi-supervised reward learning for offline reinforcement learning
Viaarxiv icon

Offline Learning from Demonstrations and Unlabeled Experience

Nov 27, 2020
Konrad Zolna, Alexander Novikov, Ksenia Konyushkova, Caglar Gulcehre, Ziyu Wang, Yusuf Aytar, Misha Denil, Nando de Freitas, Scott Reed

Figure 1 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 2 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 3 for Offline Learning from Demonstrations and Unlabeled Experience
Figure 4 for Offline Learning from Demonstrations and Unlabeled Experience
Viaarxiv icon

A Framework for Data-Driven Robotics

Sep 26, 2019
Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott Reed, Rae Jeong, Konrad Żołna, Yusuf Aytar, David Budden, Mel Vecerik, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang

Figure 1 for A Framework for Data-Driven Robotics
Figure 2 for A Framework for Data-Driven Robotics
Figure 3 for A Framework for Data-Driven Robotics
Figure 4 for A Framework for Data-Driven Robotics
Viaarxiv icon