Alert button
Picture for JB Lanier

JB Lanier

Alert button

Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors

Add code
Bookmark button
Alert button
Jul 21, 2023
Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, JB Lanier, Pierre Baldi, Roy Fox, Sameer Singh

Figure 1 for Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Figure 2 for Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Figure 3 for Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Figure 4 for Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Viaarxiv icon

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments

Add code
Bookmark button
Alert button
Jul 19, 2022
JB Lanier, Stephen McAleer, Pierre Baldi, Roy Fox

Figure 1 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 2 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 3 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 4 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Viaarxiv icon

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

Add code
Bookmark button
Alert button
Jul 13, 2022
Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, Roy Fox, Tuomas Sandholm

Figure 1 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 2 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 3 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 4 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Viaarxiv icon