Picture for Aston Zhang

Aston Zhang

Jack

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Let's (not) just put things in Context: Test-Time Training for Long-Context LLMs

Add code
Dec 15, 2025
Viaarxiv icon

A Systematic Examination of Preference Learning through the Lens of Instruction-Following

Add code
Dec 18, 2024
Viaarxiv icon

Self-Generated Critiques Boost Reward Modeling for Language Models

Add code
Nov 25, 2024
Figure 1 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 2 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 3 for Self-Generated Critiques Boost Reward Modeling for Language Models
Figure 4 for Self-Generated Critiques Boost Reward Modeling for Language Models
Viaarxiv icon

Law of the Weakest Link: Cross Capabilities of Large Language Models

Add code
Sep 30, 2024
Figure 1 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 2 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 3 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 4 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Viaarxiv icon

Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions

Add code
Aug 05, 2024
Figure 1 for Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Figure 2 for Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Figure 3 for Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Figure 4 for Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

Add code
Nov 20, 2023
Figure 1 for Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Figure 2 for Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Figure 3 for Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Figure 4 for Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
Viaarxiv icon

In-Context Learning with Iterative Demonstration Selection

Add code
Oct 22, 2023
Figure 1 for In-Context Learning with Iterative Demonstration Selection
Figure 2 for In-Context Learning with Iterative Demonstration Selection
Figure 3 for In-Context Learning with Iterative Demonstration Selection
Figure 4 for In-Context Learning with Iterative Demonstration Selection
Viaarxiv icon

You Only Look at Screens: Multimodal Chain-of-Action Agents

Add code
Sep 21, 2023
Figure 1 for You Only Look at Screens: Multimodal Chain-of-Action Agents
Figure 2 for You Only Look at Screens: Multimodal Chain-of-Action Agents
Figure 3 for You Only Look at Screens: Multimodal Chain-of-Action Agents
Figure 4 for You Only Look at Screens: Multimodal Chain-of-Action Agents
Viaarxiv icon