Alert button
Picture for Mengdi Wang

Mengdi Wang

Alert button

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

Add code
Bookmark button
Alert button
Feb 07, 2024
Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson

Viaarxiv icon

Embedding Large Language Models into Extended Reality: Opportunities and Challenges for Inclusion, Engagement, and Privacy

Add code
Bookmark button
Alert button
Feb 06, 2024
Efe Bozkir, Süleyman Özdel, Ka Hei Carrie Lau, Mengdi Wang, Hong Gao, Enkelejda Kasneci

Viaarxiv icon

TurboSVM-FL: Boosting Federated Learning through SVM Aggregation for Lazy Clients

Add code
Bookmark button
Alert button
Jan 29, 2024
Mengdi Wang, Anna Bodonhelyi, Efe Bozkir, Enkelejda Kasneci

Viaarxiv icon

Scalable Normalizing Flows Enable Boltzmann Generators for Macromolecules

Add code
Bookmark button
Alert button
Jan 08, 2024
Joseph C. Kim, David Bloore, Karan Kapoor, Jun Feng, Ming-Hong Hao, Mengdi Wang

Viaarxiv icon

Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

Add code
Bookmark button
Alert button
Jan 08, 2024
Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang

Viaarxiv icon

Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning?

Add code
Bookmark button
Alert button
Nov 29, 2023
Lei Zhao, Mengdi Wang, Yu Bai

Viaarxiv icon

Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation

Add code
Bookmark button
Alert button
Nov 04, 2023
Nikki Lijing Kuang, Ming Yin, Mengdi Wang, Yu-Xiang Wang, Yi-An Ma

Figure 1 for Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Figure 2 for Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Figure 3 for Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Figure 4 for Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks

Add code
Bookmark button
Alert button
Oct 16, 2023
Zihao Li, Xiang Ji, Minshuo Chen, Mengdi Wang

Viaarxiv icon

Federated Multi-Level Optimization over Decentralized Networks

Add code
Bookmark button
Alert button
Oct 10, 2023
Shuoguang Yang, Xuezhou Zhang, Mengdi Wang

Viaarxiv icon