Alert button
Picture for Huazheng Wang

Huazheng Wang

Alert button

Embodied LLM Agents Learn to Cooperate in Organized Teams

Add code
Bookmark button
Alert button
Mar 19, 2024
Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang

Figure 1 for Embodied LLM Agents Learn to Cooperate in Organized Teams
Figure 2 for Embodied LLM Agents Learn to Cooperate in Organized Teams
Figure 3 for Embodied LLM Agents Learn to Cooperate in Organized Teams
Figure 4 for Embodied LLM Agents Learn to Cooperate in Organized Teams
Viaarxiv icon

AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks

Add code
Bookmark button
Alert button
Mar 02, 2024
Yifan Zeng, Yiran Wu, Xiao Zhang, Huazheng Wang, Qingyun Wu

Figure 1 for AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
Figure 2 for AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
Figure 3 for AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
Figure 4 for AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
Viaarxiv icon

Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits

Add code
Bookmark button
Alert button
Feb 21, 2024
Zhiwei Wang, Huazheng Wang, Hongning Wang

Viaarxiv icon

Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

Add code
Bookmark button
Alert button
Jan 08, 2024
Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang

Viaarxiv icon

Pure Exploration in Asynchronous Federated Bandits

Add code
Bookmark button
Alert button
Oct 17, 2023
Zichen Wang, Chuanhao Li, Chenyu Song, Lianghui Wang, Quanquan Gu, Huazheng Wang

Viaarxiv icon

Adversarial Attacks on Combinatorial Multi-Armed Bandits

Add code
Bookmark button
Alert button
Oct 08, 2023
Rishab Balasubramanian, Jiawei Li, Prasad Tadepalli, Huazheng Wang, Qingyun Wu, Haoyu Zhao

Viaarxiv icon

Aligning Agent Policy with Externalities: Reward Design via Bilevel RL

Add code
Bookmark button
Alert button
Aug 03, 2023
Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Dinesh Manocha, Huazheng Wang, Furong Huang, Mengdi Wang

Figure 1 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 2 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 3 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Figure 4 for Aligning Agent Policy with Externalities: Reward Design via Bilevel RL
Viaarxiv icon

Online Modeling and Monitoring of Dependent Processes under Resource Constraints

Add code
Bookmark button
Alert button
Jul 26, 2023
Tanapol Kosolwattana, Huazheng Wang, Ying Lin

Figure 1 for Online Modeling and Monitoring of Dependent Processes under Resource Constraints
Figure 2 for Online Modeling and Monitoring of Dependent Processes under Resource Constraints
Figure 3 for Online Modeling and Monitoring of Dependent Processes under Resource Constraints
Viaarxiv icon

How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?

Add code
Bookmark button
Alert button
Jul 26, 2023
Huazheng Wang, Daixuan Cheng, Haifeng Sun, Jingyu Wang, Qi Qi, Jianxin Liao, Jing Wang, Cong Liu

Figure 1 for How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Figure 2 for How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Figure 3 for How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Figure 4 for How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?
Viaarxiv icon

Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems

Add code
Bookmark button
Alert button
Jul 24, 2023
Xiang Ji, Huazheng Wang, Minshuo Chen, Tuo Zhao, Mengdi Wang

Viaarxiv icon