Alert button
Picture for Jacob Hilton

Jacob Hilton

Alert button

TruthfulQA: Measuring How Models Mimic Human Falsehoods

Add code
Bookmark button
Alert button
Sep 08, 2021
Stephanie Lin, Jacob Hilton, Owain Evans

Figure 1 for TruthfulQA: Measuring How Models Mimic Human Falsehoods
Figure 2 for TruthfulQA: Measuring How Models Mimic Human Falsehoods
Figure 3 for TruthfulQA: Measuring How Models Mimic Human Falsehoods
Figure 4 for TruthfulQA: Measuring How Models Mimic Human Falsehoods
Viaarxiv icon

Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark

Add code
Bookmark button
Alert button
Mar 29, 2021
Sharada Mohanty, Jyotish Poonganam, Adrien Gaidon, Andrey Kolobov, Blake Wulfe, Dipam Chakraborty, Gražvydas Šemetulskis, João Schapke, Jonas Kubilius, Jurgis Pašukonis, Linas Klimas, Matthew Hausknecht, Patrick MacAlpine, Quang Nhat Tran, Thomas Tumiel, Xiaocheng Tang, Xinwei Chen, Christopher Hesse, Jacob Hilton, William Hebgen Guss, Sahika Genc, John Schulman, Karl Cobbe

Figure 1 for Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark
Figure 2 for Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark
Figure 3 for Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark
Figure 4 for Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark
Viaarxiv icon

Phasic Policy Gradient

Add code
Bookmark button
Alert button
Sep 09, 2020
Karl Cobbe, Jacob Hilton, Oleg Klimov, John Schulman

Figure 1 for Phasic Policy Gradient
Figure 2 for Phasic Policy Gradient
Figure 3 for Phasic Policy Gradient
Figure 4 for Phasic Policy Gradient
Viaarxiv icon

Leveraging Procedural Generation to Benchmark Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 03, 2019
Karl Cobbe, Christopher Hesse, Jacob Hilton, John Schulman

Figure 1 for Leveraging Procedural Generation to Benchmark Reinforcement Learning
Figure 2 for Leveraging Procedural Generation to Benchmark Reinforcement Learning
Figure 3 for Leveraging Procedural Generation to Benchmark Reinforcement Learning
Figure 4 for Leveraging Procedural Generation to Benchmark Reinforcement Learning
Viaarxiv icon