Picture for Chao Tian

Chao Tian

Provable Policy Gradient Methods for Average-Reward Markov Potential Games

Add code
Mar 09, 2024
Figure 1 for Provable Policy Gradient Methods for Average-Reward Markov Potential Games
Figure 2 for Provable Policy Gradient Methods for Average-Reward Markov Potential Games
Viaarxiv icon

Federated Linear Bandits with Finite Adversarial Actions

Add code
Nov 02, 2023
Viaarxiv icon

Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection

Add code
Aug 23, 2023
Figure 1 for Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection
Figure 2 for Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection
Figure 3 for Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection
Figure 4 for Cross-Modality Proposal-guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection
Viaarxiv icon

Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation

Add code
Jul 17, 2023
Figure 1 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 2 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 3 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Figure 4 for Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Viaarxiv icon

Exactly Tight Information-Theoretic Generalization Error Bound for the Quadratic Gaussian Problem

Add code
May 01, 2023
Viaarxiv icon

Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning

Add code
Jun 10, 2022
Figure 1 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 2 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 3 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Figure 4 for Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Viaarxiv icon

Approximate Top-$m$ Arm Identification with Heterogeneous Reward Variances

Add code
Apr 11, 2022
Viaarxiv icon

On Top-$k$ Selection from $m$-wise Partial Rankings via Borda Counting

Add code
Apr 11, 2022
Figure 1 for On Top-$k$ Selection from $m$-wise Partial Rankings via Borda Counting
Viaarxiv icon

Fast Global Convergence of Policy Optimization for Constrained MDPs

Add code
Oct 31, 2021
Figure 1 for Fast Global Convergence of Policy Optimization for Constrained MDPs
Viaarxiv icon

A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy

Add code
Sep 10, 2021
Figure 1 for A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy
Figure 2 for A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy
Figure 3 for A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy
Figure 4 for A Fast PC Algorithm with Reversed-order Pruning and A Parallelization Strategy
Viaarxiv icon