Picture for Donghwan Lee

Donghwan Lee

Suppressing Overestimation in Q-Learning through Adversarial Behaviors

Add code
Oct 10, 2023
Figure 1 for Suppressing Overestimation in Q-Learning through Adversarial Behaviors
Figure 2 for Suppressing Overestimation in Q-Learning through Adversarial Behaviors
Figure 3 for Suppressing Overestimation in Q-Learning through Adversarial Behaviors
Figure 4 for Suppressing Overestimation in Q-Learning through Adversarial Behaviors
Viaarxiv icon

A primal-dual perspective for distributed TD-learning

Add code
Oct 01, 2023
Viaarxiv icon

On the Local Quadratic Stability of T-S Fuzzy Systems in the Vicinity of the Origin

Add code
Sep 14, 2023
Viaarxiv icon

An O.D.E. Framework of Distributed TD-Learning for Networked Multi-Agent Markov Decision Processes

Add code
Aug 17, 2023
Figure 1 for An O.D.E. Framework of Distributed TD-Learning for Networked Multi-Agent Markov Decision Processes
Figure 2 for An O.D.E. Framework of Distributed TD-Learning for Networked Multi-Agent Markov Decision Processes
Figure 3 for An O.D.E. Framework of Distributed TD-Learning for Networked Multi-Agent Markov Decision Processes
Figure 4 for An O.D.E. Framework of Distributed TD-Learning for Networked Multi-Agent Markov Decision Processes
Viaarxiv icon

Temporal Difference Learning with Experience Replay

Add code
Jun 16, 2023
Figure 1 for Temporal Difference Learning with Experience Replay
Figure 2 for Temporal Difference Learning with Experience Replay
Figure 3 for Temporal Difference Learning with Experience Replay
Viaarxiv icon

Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach

Add code
Jun 12, 2023
Viaarxiv icon

Optimal Heterogeneous Collaborative Linear Regression and Contextual Bandits

Add code
Jun 09, 2023
Figure 1 for Optimal Heterogeneous Collaborative Linear Regression and Contextual Bandits
Figure 2 for Optimal Heterogeneous Collaborative Linear Regression and Contextual Bandits
Figure 3 for Optimal Heterogeneous Collaborative Linear Regression and Contextual Bandits
Figure 4 for Optimal Heterogeneous Collaborative Linear Regression and Contextual Bandits
Viaarxiv icon

TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering

Add code
Mar 27, 2023
Figure 1 for TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering
Figure 2 for TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering
Figure 3 for TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering
Figure 4 for TMO: Textured Mesh Acquisition of Objects with a Mobile Device by using Differentiable Rendering
Viaarxiv icon

Backstepping Temporal Difference Learning

Add code
Feb 28, 2023
Figure 1 for Backstepping Temporal Difference Learning
Figure 2 for Backstepping Temporal Difference Learning
Figure 3 for Backstepping Temporal Difference Learning
Figure 4 for Backstepping Temporal Difference Learning
Viaarxiv icon

Demystifying Disagreement-on-the-Line in High Dimensions

Add code
Jan 31, 2023
Figure 1 for Demystifying Disagreement-on-the-Line in High Dimensions
Figure 2 for Demystifying Disagreement-on-the-Line in High Dimensions
Figure 3 for Demystifying Disagreement-on-the-Line in High Dimensions
Figure 4 for Demystifying Disagreement-on-the-Line in High Dimensions
Viaarxiv icon