Alert button
Picture for Yuanzhi Li

Yuanzhi Li

Alert button

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

Mar 01, 2024
Ruiqian Nai, Zixin Wen, Ji Li, Yuanzhi Li, Yang Gao

Figure 1 for Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Figure 2 for Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Figure 3 for Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Figure 4 for Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Viaarxiv icon

Provably learning a multi-head attention layer

Feb 06, 2024
Sitan Chen, Yuanzhi Li

Viaarxiv icon

TinyGSM: achieving >80% on GSM8k with small language models

Dec 14, 2023
Bingbin Liu, Sebastien Bubeck, Ronen Eldan, Janardhan Kulkarni, Yuanzhi Li, Anh Nguyen, Rachel Ward, Yi Zhang

Viaarxiv icon

Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

Nov 28, 2023
Harsha Nori, Yin Tat Lee, Sheng Zhang, Dean Carignan, Richard Edgar, Nicolo Fusi, Nicholas King, Jonathan Larson, Yuanzhi Li, Weishung Liu, Renqian Luo, Scott Mayer McKinney, Robert Osazuwa Ness, Hoifung Poon, Tao Qin, Naoto Usuyama, Chris White, Eric Horvitz

Viaarxiv icon

Positional Description Matters for Transformers Arithmetic

Nov 22, 2023
Ruoqi Shen, Sébastien Bubeck, Ronen Eldan, Yin Tat Lee, Yuanzhi Li, Yi Zhang

Viaarxiv icon

Simple Mechanisms for Representing, Indexing and Manipulating Concepts

Oct 18, 2023
Yuanzhi Li, Raghu Meka, Rina Panigrahy, Kulin Shah

Viaarxiv icon

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Oct 04, 2023
Yue Wu, Xuan Tang, Tom M. Mitchell, Yuanzhi Li

Figure 1 for SmartPlay : A Benchmark for LLMs as Intelligent Agents
Figure 2 for SmartPlay : A Benchmark for LLMs as Intelligent Agents
Figure 3 for SmartPlay : A Benchmark for LLMs as Intelligent Agents
Figure 4 for SmartPlay : A Benchmark for LLMs as Intelligent Agents
Viaarxiv icon

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

Oct 02, 2023
Zixiang Chen, Yihe Deng, Yuanzhi Li, Quanquan Gu

Figure 1 for Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Figure 2 for Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Figure 3 for Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Figure 4 for Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
Viaarxiv icon

Physics of Language Models: Part 3.2, Knowledge Manipulation

Sep 25, 2023
Zeyuan Allen-Zhu, Yuanzhi Li

Figure 1 for Physics of Language Models: Part 3.2, Knowledge Manipulation
Figure 2 for Physics of Language Models: Part 3.2, Knowledge Manipulation
Figure 3 for Physics of Language Models: Part 3.2, Knowledge Manipulation
Figure 4 for Physics of Language Models: Part 3.2, Knowledge Manipulation
Viaarxiv icon

Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

Sep 25, 2023
Zeyuan Allen Zhu, Yuanzhi Li

Figure 1 for Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Figure 2 for Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Figure 3 for Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Figure 4 for Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Viaarxiv icon