Alert button
Picture for Jiheng Zhang

Jiheng Zhang

Alert button

RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model

Add code
Bookmark button
Alert button
Mar 20, 2024
Junyi Fan, Yuxuan Han, Jialin Zeng, Jian-Feng Cai, Yang Wang, Yang Xiang, Jiheng Zhang

Figure 1 for RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model
Viaarxiv icon

Stochastic Graph Bandit Learning with Side-Observations

Add code
Bookmark button
Alert button
Aug 29, 2023
Xueping Gong, Jiheng Zhang

Viaarxiv icon

Provably Efficient Learning in Partially Observable Contextual Bandit

Add code
Bookmark button
Alert button
Aug 07, 2023
Xueping Gong, Jiheng Zhang

Figure 1 for Provably Efficient Learning in Partially Observable Contextual Bandit
Figure 2 for Provably Efficient Learning in Partially Observable Contextual Bandit
Figure 3 for Provably Efficient Learning in Partially Observable Contextual Bandit
Figure 4 for Provably Efficient Learning in Partially Observable Contextual Bandit
Viaarxiv icon

Debiasing Recommendation by Learning Identifiable Latent Confounders

Add code
Bookmark button
Alert button
Feb 10, 2023
Qing Zhang, Xiaoying Zhang, Yang Liu, Hongning Wang, Min Gao, Jiheng Zhang, Ruocheng Guo

Figure 1 for Debiasing Recommendation by Learning Identifiable Latent Confounders
Figure 2 for Debiasing Recommendation by Learning Identifiable Latent Confounders
Figure 3 for Debiasing Recommendation by Learning Identifiable Latent Confounders
Figure 4 for Debiasing Recommendation by Learning Identifiable Latent Confounders
Viaarxiv icon

Single-Trajectory Distributionally Robust Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 27, 2023
Zhipeng Liang, Xiaoteng Ma, Jose Blanchet, Jiheng Zhang, Zhengyuan Zhou

Figure 1 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 2 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 3 for Single-Trajectory Distributionally Robust Reinforcement Learning
Figure 4 for Single-Trajectory Distributionally Robust Reinforcement Learning
Viaarxiv icon

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Sep 29, 2022
Xiaoteng Ma, Zhipeng Liang, Jose Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou

Figure 1 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Dual Instrumental Method for Confounded Kernelized Bandits

Add code
Bookmark button
Alert button
Sep 07, 2022
Xueping Gong, Jiheng Zhang

Figure 1 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 2 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 3 for Dual Instrumental Method for Confounded Kernelized Bandits
Figure 4 for Dual Instrumental Method for Confounded Kernelized Bandits
Viaarxiv icon

On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits

Add code
Bookmark button
Alert button
Jun 16, 2022
Yuxuan Han, Zhicong Liang, Zhipeng Liang, Yang Wang, Yuan Yao, Jiheng Zhang

Figure 1 for On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits
Figure 2 for On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits
Figure 3 for On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits
Figure 4 for On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits
Viaarxiv icon

Generalized Linear Bandits with Local Differential Privacy

Add code
Bookmark button
Alert button
Jun 07, 2021
Yuxuan Han, Zhipeng Liang, Yang Wang, Jiheng Zhang

Figure 1 for Generalized Linear Bandits with Local Differential Privacy
Figure 2 for Generalized Linear Bandits with Local Differential Privacy
Figure 3 for Generalized Linear Bandits with Local Differential Privacy
Viaarxiv icon