Alert button
Picture for Mengdi Wang

Mengdi Wang

Alert button

Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

Feb 16, 2024
Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang

Viaarxiv icon

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

Feb 14, 2024
Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Singh Bedi, Mengdi Wang

Viaarxiv icon

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

Feb 07, 2024
Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson

Viaarxiv icon

Embedding Large Language Models into Extended Reality: Opportunities and Challenges for Inclusion, Engagement, and Privacy

Feb 06, 2024
Efe Bozkir, Süleyman Özdel, Ka Hei Carrie Lau, Mengdi Wang, Hong Gao, Enkelejda Kasneci

Viaarxiv icon

TurboSVM-FL: Boosting Federated Learning through SVM Aggregation for Lazy Clients

Jan 29, 2024
Mengdi Wang, Anna Bodonhelyi, Efe Bozkir, Enkelejda Kasneci

Viaarxiv icon

Scalable Normalizing Flows Enable Boltzmann Generators for Macromolecules

Jan 08, 2024
Joseph C. Kim, David Bloore, Karan Kapoor, Jun Feng, Ming-Hong Hao, Mengdi Wang

Viaarxiv icon

Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

Jan 08, 2024
Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang

Viaarxiv icon

Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning?

Nov 29, 2023
Lei Zhao, Mengdi Wang, Yu Bai

Viaarxiv icon

Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation

Nov 04, 2023
Nikki Lijing Kuang, Ming Yin, Mengdi Wang, Yu-Xiang Wang, Yi-An Ma

Figure 1 for Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Figure 2 for Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Figure 3 for Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Figure 4 for Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Viaarxiv icon