Picture for Vibhavari Dasagi

Vibhavari Dasagi

States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers

Add code
Feb 06, 2024
Figure 1 for States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers
Figure 2 for States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers
Figure 3 for States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers
Figure 4 for States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers
Viaarxiv icon

Human-Timescale Adaptation in an Open-Ended Task Space

Add code
Jan 18, 2023
Figure 1 for Human-Timescale Adaptation in an Open-Ended Task Space
Figure 2 for Human-Timescale Adaptation in an Open-Ended Task Space
Figure 3 for Human-Timescale Adaptation in an Open-Ended Task Space
Figure 4 for Human-Timescale Adaptation in an Open-Ended Task Space
Viaarxiv icon

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

Add code
Sep 22, 2022
Viaarxiv icon

The Challenges of Exploration for Offline Reinforcement Learning

Add code
Jan 27, 2022
Figure 1 for The Challenges of Exploration for Offline Reinforcement Learning
Figure 2 for The Challenges of Exploration for Offline Reinforcement Learning
Figure 3 for The Challenges of Exploration for Offline Reinforcement Learning
Figure 4 for The Challenges of Exploration for Offline Reinforcement Learning
Viaarxiv icon

Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots

Add code
Dec 10, 2021
Figure 1 for Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots
Figure 2 for Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots
Figure 3 for Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots
Figure 4 for Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots
Viaarxiv icon

Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration

Add code
Sep 17, 2021
Figure 1 for Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
Figure 2 for Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
Figure 3 for Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
Figure 4 for Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
Viaarxiv icon

Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics

Add code
Jul 22, 2021
Figure 1 for Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Figure 2 for Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Figure 3 for Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Figure 4 for Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics
Viaarxiv icon

Learning Arbitrary-Goal Fabric Folding with One Hour of Real Robot Experience

Add code
Oct 07, 2020
Figure 1 for Learning Arbitrary-Goal Fabric Folding with One Hour of Real Robot Experience
Figure 2 for Learning Arbitrary-Goal Fabric Folding with One Hour of Real Robot Experience
Figure 3 for Learning Arbitrary-Goal Fabric Folding with One Hour of Real Robot Experience
Figure 4 for Learning Arbitrary-Goal Fabric Folding with One Hour of Real Robot Experience
Viaarxiv icon

Multiplicative Controller Fusion: A Hybrid Navigation Strategy For Deployment in Unknown Environments

Add code
Mar 13, 2020
Figure 1 for Multiplicative Controller Fusion: A Hybrid Navigation Strategy For Deployment in Unknown Environments
Figure 2 for Multiplicative Controller Fusion: A Hybrid Navigation Strategy For Deployment in Unknown Environments
Figure 3 for Multiplicative Controller Fusion: A Hybrid Navigation Strategy For Deployment in Unknown Environments
Figure 4 for Multiplicative Controller Fusion: A Hybrid Navigation Strategy For Deployment in Unknown Environments
Viaarxiv icon

Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks

Add code
Nov 20, 2019
Figure 1 for Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Figure 2 for Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Figure 3 for Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Figure 4 for Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks
Viaarxiv icon