Picture for Yuchen Xiao

Yuchen Xiao

O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models

Add code
Oct 22, 2023
Figure 1 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 2 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 3 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Figure 4 for O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models
Viaarxiv icon

On-Robot Bayesian Reinforcement Learning for POMDPs

Add code
Jul 22, 2023
Figure 1 for On-Robot Bayesian Reinforcement Learning for POMDPs
Figure 2 for On-Robot Bayesian Reinforcement Learning for POMDPs
Figure 3 for On-Robot Bayesian Reinforcement Learning for POMDPs
Figure 4 for On-Robot Bayesian Reinforcement Learning for POMDPs
Viaarxiv icon

Sequential Fair Resource Allocation under a Markov Decision Process Framework

Add code
Jan 10, 2023
Figure 1 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 2 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 3 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Figure 4 for Sequential Fair Resource Allocation under a Markov Decision Process Framework
Viaarxiv icon

Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability

Add code
Oct 11, 2022
Figure 1 for Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability
Figure 2 for Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability
Figure 3 for Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability
Figure 4 for Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability
Viaarxiv icon

Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning

Add code
Oct 11, 2022
Figure 1 for Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
Figure 2 for Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
Figure 3 for Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
Figure 4 for Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
Viaarxiv icon

A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning

Add code
Jan 03, 2022
Figure 1 for A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
Figure 2 for A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
Figure 3 for A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
Figure 4 for A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning
Viaarxiv icon

Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning

Add code
Nov 02, 2021
Figure 1 for Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning
Figure 2 for Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning
Figure 3 for Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning
Figure 4 for Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning
Viaarxiv icon

Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning

Add code
Feb 08, 2021
Figure 1 for Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
Figure 2 for Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
Figure 3 for Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
Figure 4 for Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
Viaarxiv icon

Adaptive directional Haar tight framelets on bounded domains for digraph signal representations

Add code
Aug 27, 2020
Figure 1 for Adaptive directional Haar tight framelets on bounded domains for digraph signal representations
Figure 2 for Adaptive directional Haar tight framelets on bounded domains for digraph signal representations
Figure 3 for Adaptive directional Haar tight framelets on bounded domains for digraph signal representations
Figure 4 for Adaptive directional Haar tight framelets on bounded domains for digraph signal representations
Viaarxiv icon

Macro-Action-Based Deep Multi-Agent Reinforcement Learning

Add code
Apr 18, 2020
Figure 1 for Macro-Action-Based Deep Multi-Agent Reinforcement Learning
Figure 2 for Macro-Action-Based Deep Multi-Agent Reinforcement Learning
Figure 3 for Macro-Action-Based Deep Multi-Agent Reinforcement Learning
Figure 4 for Macro-Action-Based Deep Multi-Agent Reinforcement Learning
Viaarxiv icon