Picture for Tadashi Kozuno

Tadashi Kozuno

Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist

Add code
Feb 28, 2024
Viaarxiv icon

A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees

Add code
Feb 02, 2024
Viaarxiv icon

Multi-Agent Behavior Retrieval

Add code
Dec 04, 2023
Viaarxiv icon

Local and adaptive mirror descents in extensive-form games

Add code
Sep 01, 2023
Figure 1 for Local and adaptive mirror descents in extensive-form games
Viaarxiv icon

DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm

Add code
May 29, 2023
Figure 1 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 2 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 3 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Figure 4 for DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm
Viaarxiv icon

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Add code
May 22, 2023
Figure 1 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 2 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 3 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 4 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Viaarxiv icon

Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation

Add code
May 19, 2023
Figure 1 for Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Figure 2 for Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Figure 3 for Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Figure 4 for Counterfactual Fairness Filter for Fair-Delay Multi-Robot Navigation
Viaarxiv icon

When to Replan? An Adaptive Replanning Strategy for Autonomous Navigation using Deep Reinforcement Learning

Add code
Apr 24, 2023
Figure 1 for When to Replan? An Adaptive Replanning Strategy for Autonomous Navigation using Deep Reinforcement Learning
Figure 2 for When to Replan? An Adaptive Replanning Strategy for Autonomous Navigation using Deep Reinforcement Learning
Figure 3 for When to Replan? An Adaptive Replanning Strategy for Autonomous Navigation using Deep Reinforcement Learning
Figure 4 for When to Replan? An Adaptive Replanning Strategy for Autonomous Navigation using Deep Reinforcement Learning
Viaarxiv icon

Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints

Add code
Apr 18, 2023
Figure 1 for Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints
Figure 2 for Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints
Figure 3 for Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints
Figure 4 for Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints
Viaarxiv icon

Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model

Add code
Feb 02, 2023
Figure 1 for Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model
Figure 2 for Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model
Figure 3 for Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model
Figure 4 for Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model
Viaarxiv icon