Picture for Jan Peters

Jan Peters

An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients

Add code
Jul 20, 2021
Figure 1 for An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
Figure 2 for An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
Figure 3 for An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
Figure 4 for An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
Viaarxiv icon

Efficient and Reactive Planning for High Speed Robot Air Hockey

Add code
Jul 14, 2021
Figure 1 for Efficient and Reactive Planning for High Speed Robot Air Hockey
Figure 2 for Efficient and Reactive Planning for High Speed Robot Air Hockey
Figure 3 for Efficient and Reactive Planning for High Speed Robot Air Hockey
Figure 4 for Efficient and Reactive Planning for High Speed Robot Air Hockey
Viaarxiv icon

High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning

Add code
Jun 16, 2021
Figure 1 for High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning
Figure 2 for High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning
Figure 3 for High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning
Figure 4 for High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning
Viaarxiv icon

Robust Value Iteration for Continuous Control Tasks

Add code
May 25, 2021
Figure 1 for Robust Value Iteration for Continuous Control Tasks
Figure 2 for Robust Value Iteration for Continuous Control Tasks
Figure 3 for Robust Value Iteration for Continuous Control Tasks
Figure 4 for Robust Value Iteration for Continuous Control Tasks
Viaarxiv icon

Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers

Add code
May 17, 2021
Figure 1 for Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers
Figure 2 for Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers
Figure 3 for Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers
Figure 4 for Evolutionary Training and Abstraction Yields Algorithmic Generalization of Neural Computers
Viaarxiv icon

Stochastic Control through Approximate Bayesian Input Inference

Add code
May 17, 2021
Figure 1 for Stochastic Control through Approximate Bayesian Input Inference
Figure 2 for Stochastic Control through Approximate Bayesian Input Inference
Figure 3 for Stochastic Control through Approximate Bayesian Input Inference
Figure 4 for Stochastic Control through Approximate Bayesian Input Inference
Viaarxiv icon

Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning

Add code
May 11, 2021
Figure 1 for Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Figure 2 for Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Figure 3 for Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Figure 4 for Composable Energy Policies for Reactive Motion Generation and Reinforcement Learning
Viaarxiv icon

Value Iteration in Continuous Actions, States and Time

Add code
May 10, 2021
Figure 1 for Value Iteration in Continuous Actions, States and Time
Figure 2 for Value Iteration in Continuous Actions, States and Time
Figure 3 for Value Iteration in Continuous Actions, States and Time
Figure 4 for Value Iteration in Continuous Actions, States and Time
Viaarxiv icon

Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation

Add code
May 05, 2021
Figure 1 for Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Figure 2 for Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Figure 3 for Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Figure 4 for Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
Viaarxiv icon

Reinforcement Learning using Guided Observability

Add code
Apr 22, 2021
Figure 1 for Reinforcement Learning using Guided Observability
Figure 2 for Reinforcement Learning using Guided Observability
Figure 3 for Reinforcement Learning using Guided Observability
Figure 4 for Reinforcement Learning using Guided Observability
Viaarxiv icon