Picture for Taisuke Kobayashi

Taisuke Kobayashi

CubeDAgger: Improved Robustness of Interactive Imitation Learning without Violation of Dynamic Stability

Add code
May 08, 2025
Viaarxiv icon

Improvements of Dark Experience Replay and Reservoir Sampling towards Better Balance between Consolidation and Plasticity

Add code
Apr 29, 2025
Viaarxiv icon

Weber-Fechner Law in Temporal Difference learning derived from Control as Inference

Add code
Dec 30, 2024
Figure 1 for Weber-Fechner Law in Temporal Difference learning derived from Control as Inference
Figure 2 for Weber-Fechner Law in Temporal Difference learning derived from Control as Inference
Figure 3 for Weber-Fechner Law in Temporal Difference learning derived from Control as Inference
Figure 4 for Weber-Fechner Law in Temporal Difference learning derived from Control as Inference
Viaarxiv icon

Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency

Add code
Dec 17, 2024
Viaarxiv icon

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning

Add code
Oct 22, 2024
Viaarxiv icon

Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization through Explicit Multi-Domain Convex Coverage Set Learning

Add code
Oct 07, 2024
Viaarxiv icon

LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World

Add code
Sep 29, 2024
Viaarxiv icon

Revisiting Experience Replayable Conditions

Add code
Feb 15, 2024
Viaarxiv icon

Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward

Add code
Aug 24, 2023
Viaarxiv icon

Soft Actor-Critic Algorithm with Truly Inequality Constraint

Add code
Mar 08, 2023
Viaarxiv icon