Picture for Aonan Zhang

Aonan Zhang

Synthetic bootstrapped pretraining

Add code
Sep 17, 2025
Viaarxiv icon

Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Add code
Oct 02, 2024
Figure 1 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 2 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 3 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 4 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon

Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training

Add code
May 23, 2024
Figure 1 for Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training
Figure 2 for Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training
Figure 3 for Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training
Figure 4 for Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training
Viaarxiv icon

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Add code
Mar 22, 2024
Figure 1 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 2 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 3 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 4 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Viaarxiv icon

Recurrent Drafter for Fast Speculative Decoding in Large Language Models

Add code
Mar 22, 2024
Figure 1 for Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Figure 2 for Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Figure 3 for Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Figure 4 for Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Viaarxiv icon

Divide-or-Conquer? Which Part Should You Distill Your LLM?

Add code
Feb 22, 2024
Figure 1 for Divide-or-Conquer? Which Part Should You Distill Your LLM?
Figure 2 for Divide-or-Conquer? Which Part Should You Distill Your LLM?
Figure 3 for Divide-or-Conquer? Which Part Should You Distill Your LLM?
Figure 4 for Divide-or-Conquer? Which Part Should You Distill Your LLM?
Viaarxiv icon

Graph-Based Model-Agnostic Data Subsampling for Recommendation Systems

Add code
May 25, 2023
Viaarxiv icon

NVDiff: Graph Generation through the Diffusion of Node Vectors

Add code
Nov 19, 2022
Figure 1 for NVDiff: Graph Generation through the Diffusion of Node Vectors
Figure 2 for NVDiff: Graph Generation through the Diffusion of Node Vectors
Figure 3 for NVDiff: Graph Generation through the Diffusion of Node Vectors
Figure 4 for NVDiff: Graph Generation through the Diffusion of Node Vectors
Viaarxiv icon

Collaborative Anomaly Detection

Add code
Sep 20, 2022
Figure 1 for Collaborative Anomaly Detection
Figure 2 for Collaborative Anomaly Detection
Figure 3 for Collaborative Anomaly Detection
Figure 4 for Collaborative Anomaly Detection
Viaarxiv icon