Alert button
Picture for Huazheng Wang

Huazheng Wang

Alert button

Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits

Feb 21, 2024
Zhiwei Wang, Huazheng Wang, Hongning Wang

Viaarxiv icon

Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

Jan 08, 2024
Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang

Viaarxiv icon

Pure Exploration in Asynchronous Federated Bandits

Oct 17, 2023
Zichen Wang, Chuanhao Li, Chenyu Song, Lianghui Wang, Quanquan Gu, Huazheng Wang

Viaarxiv icon

Adversarial Attacks on Combinatorial Multi-Armed Bandits

Oct 08, 2023
Rishab Balasubramanian, Jiawei Li, Prasad Tadepalli, Huazheng Wang, Qingyun Wu, Haoyu Zhao

Viaarxiv icon

Aligning Agent Policy with Externalities: Reward Design via Bilevel RL

Aug 03, 2023
Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Dinesh Manocha, Huazheng Wang, Furong Huang, Mengdi Wang

Figure 1 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 2 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 3 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 4 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Viaarxiv icon

Online Modeling and Monitoring of Dependent Processes under Resource Constraints

Jul 26, 2023
Tanapol Kosolwattana, Huazheng Wang, Ying Lin

Figure 1 for Online Modeling and Monitoring of Dependent Processes under Resource Constraints
Figure 2 for Online Modeling and Monitoring of Dependent Processes under Resource Constraints
Figure 3 for Online Modeling and Monitoring of Dependent Processes under Resource Constraints
Viaarxiv icon

How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?

Jul 26, 2023
Huazheng Wang, Daixuan Cheng, Haifeng Sun, Jingyu Wang, Qi Qi, Jianxin Liao, Jing Wang, Cong Liu

Figure 1 for How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Figure 2 for How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Figure 3 for How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Figure 4 for How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Viaarxiv icon

Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems

Jul 24, 2023
Xiang Ji, Huazheng Wang, Minshuo Chen, Tuo Zhao, Mengdi Wang

Viaarxiv icon

Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

Jun 21, 2023
Jiacheng Guo, Zihao Li, Huazheng Wang, Mengdi Wang, Zhuoran Yang, Xuezhou Zhang

Figure 1 for Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Figure 2 for Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Figure 3 for Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Figure 4 for Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Viaarxiv icon

Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective

Jun 13, 2023
Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang

Figure 1 for Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective
Figure 2 for Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective
Figure 3 for Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective
Figure 4 for Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective
Viaarxiv icon