Alert button
Picture for Mengdi Wang

Mengdi Wang

Alert button

An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization

Add code
Bookmark button
Alert button
Apr 11, 2024
Minshuo Chen, Song Mei, Jianqing Fan, Mengdi Wang

Viaarxiv icon

Diffusion Model for Data-Driven Black-Box Optimization

Add code
Bookmark button
Alert button
Mar 20, 2024
Zihao Li, Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Yinyu Ye, Minshuo Chen, Mengdi Wang

Figure 1 for Diffusion Model for Data-Driven Black-Box Optimization
Figure 2 for Diffusion Model for Data-Driven Black-Box Optimization
Figure 3 for Diffusion Model for Data-Driven Black-Box Optimization
Figure 4 for Diffusion Model for Data-Driven Black-Box Optimization
Viaarxiv icon

Embodied LLM Agents Learn to Cooperate in Organized Teams

Add code
Bookmark button
Alert button
Mar 19, 2024
Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang

Figure 1 for Embodied LLM Agents Learn to Cooperate in Organized Teams
Figure 2 for Embodied LLM Agents Learn to Cooperate in Organized Teams
Figure 3 for Embodied LLM Agents Learn to Cooperate in Organized Teams
Figure 4 for Embodied LLM Agents Learn to Cooperate in Organized Teams
Viaarxiv icon

Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory

Add code
Bookmark button
Alert button
Mar 18, 2024
Hengyu Fu, Zhuoran Yang, Mengdi Wang, Minshuo Chen

Figure 1 for Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
Figure 2 for Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
Figure 3 for Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
Figure 4 for Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory
Viaarxiv icon

Offline Multitask Representation Learning for Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 18, 2024
Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup

Figure 1 for Offline Multitask Representation Learning for Reinforcement Learning
Viaarxiv icon

Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework

Add code
Bookmark button
Alert button
Mar 17, 2024
Kaiyan Chang, Kun Wang, Nan Yang, Ying Wang, Dantong Jin, Wenlong Zhu, Zhirong Chen, Cangyuan Li, Hao Yan, Yunhao Zhou, Zhuoliang Zhao, Yuan Cheng, Yudong Pan, Yiqi Liu, Mengdi Wang, Shengwen Liang, yinhe han, Huawei Li, Xiaowei Li

Figure 1 for Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework
Figure 2 for Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework
Figure 3 for Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework
Figure 4 for Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework
Viaarxiv icon

Regularized DeepIV with Model Selection

Add code
Bookmark button
Alert button
Mar 07, 2024
Zihao Li, Hui Lan, Vasilis Syrgkanis, Mengdi Wang, Masatoshi Uehara

Figure 1 for Regularized DeepIV with Model Selection
Figure 2 for Regularized DeepIV with Model Selection
Figure 3 for Regularized DeepIV with Model Selection
Figure 4 for Regularized DeepIV with Model Selection
Viaarxiv icon

Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models

Add code
Bookmark button
Alert button
Mar 03, 2024
Yuchen Wu, Minshuo Chen, Zihao Li, Mengdi Wang, Yuting Wei

Figure 1 for Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models
Figure 2 for Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models
Figure 3 for Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models
Figure 4 for Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models
Viaarxiv icon

Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 16, 2024
Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang

Viaarxiv icon

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

Add code
Bookmark button
Alert button
Feb 14, 2024
Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Singh Bedi, Mengdi Wang

Viaarxiv icon