Alert button
Picture for Roy Fox

Roy Fox

Alert button

Reinforcement Learning from Delayed Observations via World Models

Add code
Bookmark button
Alert button
Mar 18, 2024
Armin Karamzade, Kyungmin Kim, Montek Kalsi, Roy Fox

Figure 1 for Reinforcement Learning from Delayed Observations via World Models
Figure 2 for Reinforcement Learning from Delayed Observations via World Models
Figure 3 for Reinforcement Learning from Delayed Observations via World Models
Figure 4 for Reinforcement Learning from Delayed Observations via World Models
Viaarxiv icon

Moonwalk: Inverse-Forward Differentiation

Add code
Bookmark button
Alert button
Feb 22, 2024
Dmitrii Krylov, Armin Karamzade, Roy Fox

Viaarxiv icon

Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills

Add code
Bookmark button
Alert button
Feb 05, 2024
Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Sameer Singh, Peter Clark, Roy Fox

Viaarxiv icon

Learning to Design Analog Circuits to Meet Threshold Specifications

Add code
Bookmark button
Alert button
Jul 25, 2023
Dmitrii Krylov, Pooya Khajeh, Junhan Ouyang, Thomas Reeves, Tongkai Liu, Hiba Ajmal, Hamidreza Aghasi, Roy Fox

Figure 1 for Learning to Design Analog Circuits to Meet Threshold Specifications
Figure 2 for Learning to Design Analog Circuits to Meet Threshold Specifications
Figure 3 for Learning to Design Analog Circuits to Meet Threshold Specifications
Figure 4 for Learning to Design Analog Circuits to Meet Threshold Specifications
Viaarxiv icon

Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors

Add code
Bookmark button
Alert button
Jul 21, 2023
Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, JB Lanier, Pierre Baldi, Roy Fox, Sameer Singh

Figure 1 for Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Figure 2 for Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Figure 3 for Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Figure 4 for Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Viaarxiv icon

Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling

Add code
Bookmark button
Alert button
Jan 28, 2023
Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Yejin Choi, Hannaneh Hajishirzi, Sameer Singh, Roy Fox

Figure 1 for Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Figure 2 for Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Figure 3 for Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Figure 4 for Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling
Viaarxiv icon

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Add code
Bookmark button
Alert button
Sep 16, 2022
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

Figure 1 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 2 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 3 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 4 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Viaarxiv icon

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments

Add code
Bookmark button
Alert button
Jul 19, 2022
JB Lanier, Stephen McAleer, Pierre Baldi, Roy Fox

Figure 1 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 2 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 3 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 4 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Viaarxiv icon

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

Add code
Bookmark button
Alert button
Jul 13, 2022
Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, Roy Fox, Tuomas Sandholm

Figure 1 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 2 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 3 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 4 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Viaarxiv icon

Learning to Query Internet Text for Informing Reinforcement Learning Agents

Add code
Bookmark button
Alert button
May 25, 2022
Kolby Nottingham, Alekhya Pyla, Sameer Singh, Roy Fox

Figure 1 for Learning to Query Internet Text for Informing Reinforcement Learning Agents
Figure 2 for Learning to Query Internet Text for Informing Reinforcement Learning Agents
Figure 3 for Learning to Query Internet Text for Informing Reinforcement Learning Agents
Viaarxiv icon