Picture for Michael Bloesch

Michael Bloesch

Value from Observations: Towards Large-Scale Imitation Learning via Self-Improvement

Add code
Jul 09, 2025
Viaarxiv icon

Preference Optimization as Probabilistic Inference

Add code
Oct 05, 2024
Figure 1 for Preference Optimization as Probabilistic Inference
Figure 2 for Preference Optimization as Probabilistic Inference
Figure 3 for Preference Optimization as Probabilistic Inference
Figure 4 for Preference Optimization as Probabilistic Inference
Viaarxiv icon

Imitating Language via Scalable Inverse Reinforcement Learning

Add code
Sep 02, 2024
Figure 1 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 2 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 3 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 4 for Imitating Language via Scalable Inverse Reinforcement Learning
Viaarxiv icon

Offline Actor-Critic Reinforcement Learning Scales to Large Models

Add code
Feb 08, 2024
Figure 1 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 2 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 3 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Figure 4 for Offline Actor-Critic Reinforcement Learning Scales to Large Models
Viaarxiv icon

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots

Add code
Dec 18, 2023
Figure 1 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 2 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 3 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Figure 4 for Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Viaarxiv icon

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

Add code
Apr 26, 2023
Viaarxiv icon

Dense RGB-D-Inertial SLAM with Map Deformations

Add code
Jul 22, 2022
Figure 1 for Dense RGB-D-Inertial SLAM with Map Deformations
Figure 2 for Dense RGB-D-Inertial SLAM with Map Deformations
Figure 3 for Dense RGB-D-Inertial SLAM with Map Deformations
Figure 4 for Dense RGB-D-Inertial SLAM with Map Deformations
Viaarxiv icon

The Challenges of Exploration for Offline Reinforcement Learning

Add code
Jan 27, 2022
Viaarxiv icon

Rethinking Exploration for Sample-Efficient Policy Learning

Add code
Jan 23, 2021
Figure 1 for Rethinking Exploration for Sample-Efficient Policy Learning
Figure 2 for Rethinking Exploration for Sample-Efficient Policy Learning
Figure 3 for Rethinking Exploration for Sample-Efficient Policy Learning
Figure 4 for Rethinking Exploration for Sample-Efficient Policy Learning
Viaarxiv icon

Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion

Add code
Aug 06, 2020
Figure 1 for Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion
Figure 2 for Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion
Figure 3 for Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion
Figure 4 for Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion
Viaarxiv icon