Alert button
Picture for Nan Du

Nan Du

Alert button

Self-playing Adversarial Language Game Enhances LLM Reasoning

Add code
Bookmark button
Alert button
Apr 16, 2024
Pengyu Cheng, Tianhao Hu, Han Xu, Zhisong Zhang, Yong Dai, Lei Han, Nan Du

Viaarxiv icon

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Add code
Bookmark button
Alert button
Mar 22, 2024
Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Guoli Yin, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Figure 1 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 2 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 3 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 4 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Viaarxiv icon

Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

Add code
Bookmark button
Alert button
Feb 28, 2024
Anchun Gui, Jian Li, Yong Dai, Nan Du, Han Xiao

Viaarxiv icon

Are Large Language Models Good Prompt Optimizers?

Add code
Bookmark button
Alert button
Feb 03, 2024
Ruotian Ma, Xiaolei Wang, Xin Zhou, Jian Li, Nan Du, Tao Gui, Qi Zhang, Xuanjing Huang

Viaarxiv icon

On Diversified Preferences of Large Language Model Alignment

Add code
Bookmark button
Alert button
Dec 25, 2023
Dun Zeng, Yong Dai, Pengyu Cheng, Tianhao Hu, Wanshun Chen, Nan Du, Zenglin Xu

Viaarxiv icon

Learning to Skip for Language Modeling

Add code
Bookmark button
Alert button
Nov 26, 2023
Dewen Zeng, Nan Du, Tao Wang, Yuanzhong Xu, Tao Lei, Zhifeng Chen, Claire Cui

Viaarxiv icon

Adversarial Preference Optimization

Add code
Bookmark button
Alert button
Nov 14, 2023
Pengyu Cheng, Yifan Yang, Jian Li, Yong Dai, Nan Du

Figure 1 for Adversarial Preference Optimization
Figure 2 for Adversarial Preference Optimization
Figure 3 for Adversarial Preference Optimization
Figure 4 for Adversarial Preference Optimization
Viaarxiv icon

Everyone Deserves A Reward: Learning Customized Human Preferences

Add code
Bookmark button
Alert button
Sep 15, 2023
Pengyu Cheng, Jiawen Xie, Ke Bai, Yong Dai, Nan Du

Viaarxiv icon