Picture for Christopher Ré

Christopher Ré

Department of Computer Science, Stanford University

Aioli: A Unified Optimization Framework for Language Model Data Mixing

Add code
Nov 08, 2024
Figure 1 for Aioli: A Unified Optimization Framework for Language Model Data Mixing
Figure 2 for Aioli: A Unified Optimization Framework for Language Model Data Mixing
Figure 3 for Aioli: A Unified Optimization Framework for Language Model Data Mixing
Figure 4 for Aioli: A Unified Optimization Framework for Language Model Data Mixing
Viaarxiv icon

Scaling Laws for Precision

Add code
Nov 07, 2024
Viaarxiv icon

ThunderKittens: Simple, Fast, and Adorable AI Kernels

Add code
Oct 27, 2024
Viaarxiv icon

LoLCATs: On Low-Rank Linearizing of Large Language Models

Add code
Oct 14, 2024
Figure 1 for LoLCATs: On Low-Rank Linearizing of Large Language Models
Figure 2 for LoLCATs: On Low-Rank Linearizing of Large Language Models
Figure 3 for LoLCATs: On Low-Rank Linearizing of Large Language Models
Figure 4 for LoLCATs: On Low-Rank Linearizing of Large Language Models
Viaarxiv icon

Automated Rewards via LLM-Generated Progress Functions

Add code
Oct 11, 2024
Figure 1 for Automated Rewards via LLM-Generated Progress Functions
Figure 2 for Automated Rewards via LLM-Generated Progress Functions
Figure 3 for Automated Rewards via LLM-Generated Progress Functions
Figure 4 for Automated Rewards via LLM-Generated Progress Functions
Viaarxiv icon

Restructuring Vector Quantization with the Rotation Trick

Add code
Oct 08, 2024
Figure 1 for Restructuring Vector Quantization with the Rotation Trick
Figure 2 for Restructuring Vector Quantization with the Rotation Trick
Figure 3 for Restructuring Vector Quantization with the Rotation Trick
Figure 4 for Restructuring Vector Quantization with the Rotation Trick
Viaarxiv icon

Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates

Add code
Oct 07, 2024
Figure 1 for Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates
Figure 2 for Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates
Figure 3 for Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates
Figure 4 for Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates
Viaarxiv icon

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Add code
Jul 31, 2024
Figure 1 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 2 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 3 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 4 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Viaarxiv icon

Just read twice: closing the recall gap for recurrent language models

Add code
Jul 07, 2024
Figure 1 for Just read twice: closing the recall gap for recurrent language models
Figure 2 for Just read twice: closing the recall gap for recurrent language models
Figure 3 for Just read twice: closing the recall gap for recurrent language models
Figure 4 for Just read twice: closing the recall gap for recurrent language models
Viaarxiv icon

State-Free Inference of State-Space Models: The Transfer Function Approach

Add code
May 10, 2024
Viaarxiv icon