Alert button
Picture for Chengshuai Shi

Chengshuai Shi

Alert button

Best Arm Identification for Prompt Learning under a Limited Budget

Add code
Bookmark button
Alert button
Feb 20, 2024
Chengshuai Shi, Kun Yang, Jing Yang, Cong Shen

Viaarxiv icon

Harnessing the Power of Federated Learning in Federated Contextual Bandits

Add code
Bookmark button
Alert button
Dec 26, 2023
Chengshuai Shi, Ruida Zhou, Kun Yang, Cong Shen

Viaarxiv icon

Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources

Add code
Bookmark button
Alert button
Jun 14, 2023
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Figure 1 for Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources
Figure 2 for Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources
Figure 3 for Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources
Figure 4 for Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources
Viaarxiv icon

On High-dimensional and Low-rank Tensor Bandits

Add code
Bookmark button
Alert button
May 06, 2023
Chengshuai Shi, Cong Shen, Nicholas D. Sidiropoulos

Figure 1 for On High-dimensional and Low-rank Tensor Bandits
Figure 2 for On High-dimensional and Low-rank Tensor Bandits
Viaarxiv icon

Reward Teaching for Federated Multi-armed Bandits

Add code
Bookmark button
Alert button
May 03, 2023
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Figure 1 for Reward Teaching for Federated Multi-armed Bandits
Figure 2 for Reward Teaching for Federated Multi-armed Bandits
Viaarxiv icon

A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games

Add code
Bookmark button
Alert button
Oct 04, 2022
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang

Viaarxiv icon

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game

Add code
Bookmark button
Alert button
May 31, 2022
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Liwei Wang, Tong Zhang

Figure 1 for Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game
Figure 2 for Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game
Figure 3 for Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game
Viaarxiv icon

Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization

Add code
Bookmark button
Alert button
Oct 29, 2021
Chengshuai Shi, Wei Xiong, Cong Shen, Jing Yang

Figure 1 for Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization
Figure 2 for Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization
Figure 3 for Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization
Figure 4 for Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization
Viaarxiv icon

(Almost) Free Incentivized Exploration from Decentralized Learning Agents

Add code
Bookmark button
Alert button
Oct 27, 2021
Chengshuai Shi, Haifeng Xu, Wei Xiong, Cong Shen

Figure 1 for (Almost) Free Incentivized Exploration from Decentralized Learning Agents
Figure 2 for (Almost) Free Incentivized Exploration from Decentralized Learning Agents
Figure 3 for (Almost) Free Incentivized Exploration from Decentralized Learning Agents
Viaarxiv icon