Picture for Yuanzhi Li

Yuanzhi Li

TinyGSM: achieving >80% on GSM8k with small language models

Add code
Dec 14, 2023
Figure 1 for TinyGSM: achieving >80% on GSM8k with small language models
Figure 2 for TinyGSM: achieving >80% on GSM8k with small language models
Figure 3 for TinyGSM: achieving >80% on GSM8k with small language models
Figure 4 for TinyGSM: achieving >80% on GSM8k with small language models
Viaarxiv icon

Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

Add code
Nov 28, 2023
Figure 1 for Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine
Figure 2 for Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine
Figure 3 for Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine
Figure 4 for Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine
Viaarxiv icon

Positional Description Matters for Transformers Arithmetic

Add code
Nov 22, 2023
Viaarxiv icon

Simple Mechanisms for Representing, Indexing and Manipulating Concepts

Add code
Oct 18, 2023
Figure 1 for Simple Mechanisms for Representing, Indexing and Manipulating Concepts
Figure 2 for Simple Mechanisms for Representing, Indexing and Manipulating Concepts
Viaarxiv icon

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Add code
Oct 04, 2023
Figure 1 for SmartPlay : A Benchmark for LLMs as Intelligent Agents
Figure 2 for SmartPlay : A Benchmark for LLMs as Intelligent Agents
Figure 3 for SmartPlay : A Benchmark for LLMs as Intelligent Agents
Figure 4 for SmartPlay : A Benchmark for LLMs as Intelligent Agents
Viaarxiv icon

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

Add code
Oct 02, 2023
Viaarxiv icon

Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

Add code
Sep 25, 2023
Viaarxiv icon

Physics of Language Models: Part 3.2, Knowledge Manipulation

Add code
Sep 25, 2023
Figure 1 for Physics of Language Models: Part 3.2, Knowledge Manipulation
Figure 2 for Physics of Language Models: Part 3.2, Knowledge Manipulation
Figure 3 for Physics of Language Models: Part 3.2, Knowledge Manipulation
Figure 4 for Physics of Language Models: Part 3.2, Knowledge Manipulation
Viaarxiv icon

Textbooks Are All You Need II: phi-1.5 technical report

Add code
Sep 11, 2023
Figure 1 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 2 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 3 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 4 for Textbooks Are All You Need II: phi-1.5 technical report
Viaarxiv icon

Efficient RLHF: Reducing the Memory Usage of PPO

Add code
Sep 01, 2023
Viaarxiv icon