Alert button
Picture for Yuanzhi Li

Yuanzhi Li

Alert button

Specifying and Solving Robust Empirical Risk Minimization Problems Using CVXPY

Add code
Bookmark button
Alert button
Jun 09, 2023
Eric Luxenberg, Dhruv Malik, Yuanzhi Li, Aarti Singh, Stephen Boyd

Figure 1 for Specifying and Solving Robust Empirical Risk Minimization Problems Using CVXPY
Figure 2 for Specifying and Solving Robust Empirical Risk Minimization Problems Using CVXPY
Viaarxiv icon

Why Clean Generalization and Robust Overfitting Both Happen in Adversarial Training

Add code
Bookmark button
Alert button
Jun 02, 2023
Binghui Li, Yuanzhi Li

Figure 1 for Why Clean Generalization and Robust Overfitting Both Happen in Adversarial Training
Figure 2 for Why Clean Generalization and Robust Overfitting Both Happen in Adversarial Training
Figure 3 for Why Clean Generalization and Robust Overfitting Both Happen in Adversarial Training
Viaarxiv icon

Toward Understanding Why Adam Converges Faster Than SGD for Transformers

Add code
Bookmark button
Alert button
May 31, 2023
Yan Pan, Yuanzhi Li

Figure 1 for Toward Understanding Why Adam Converges Faster Than SGD for Transformers
Figure 2 for Toward Understanding Why Adam Converges Faster Than SGD for Transformers
Figure 3 for Toward Understanding Why Adam Converges Faster Than SGD for Transformers
Figure 4 for Toward Understanding Why Adam Converges Faster Than SGD for Transformers
Viaarxiv icon

TinyStories: How Small Can Language Models Be and Still Speak Coherent English?

Add code
Bookmark button
Alert button
May 24, 2023
Ronen Eldan, Yuanzhi Li

Figure 1 for TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Figure 2 for TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Figure 3 for TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Figure 4 for TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Viaarxiv icon

SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning

Add code
Bookmark button
Alert button
May 24, 2023
Yue Wu, So Yeon Min, Shrimai Prabhumoye, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Tom Mitchell, Yuanzhi Li

Figure 1 for SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
Figure 2 for SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
Figure 3 for SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
Figure 4 for SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
Viaarxiv icon

Physics of Language Models: Part 1, Context-Free Grammar

Add code
Bookmark button
Alert button
May 23, 2023
Zeyuan Allen-Zhu, Yuanzhi Li

Figure 1 for Physics of Language Models: Part 1, Context-Free Grammar
Figure 2 for Physics of Language Models: Part 1, Context-Free Grammar
Figure 3 for Physics of Language Models: Part 1, Context-Free Grammar
Figure 4 for Physics of Language Models: Part 1, Context-Free Grammar
Viaarxiv icon

The probability flow ODE is provably fast

Add code
Bookmark button
Alert button
May 19, 2023
Sitan Chen, Sinho Chewi, Holden Lee, Yuanzhi Li, Jianfeng Lu, Adil Salim

Figure 1 for The probability flow ODE is provably fast
Figure 2 for The probability flow ODE is provably fast
Viaarxiv icon

Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents

Add code
Bookmark button
Alert button
May 07, 2023
Yue Wu, So Yeon Min, Yonatan Bisk, Ruslan Salakhutdinov, Amos Azaria, Yuanzhi Li, Tom Mitchell, Shrimai Prabhumoye

Figure 1 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 2 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 3 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Figure 4 for Plan, Eliminate, and Track -- Language Models are Good Teachers for Embodied Agents
Viaarxiv icon

Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality

Add code
Bookmark button
Alert button
May 04, 2023
Dhruv Malik, Conor Igoe, Yuanzhi Li, Aarti Singh

Figure 1 for Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality
Figure 2 for Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality
Figure 3 for Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality
Figure 4 for Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality
Viaarxiv icon