Picture for Azalia Mirhoseini

Azalia Mirhoseini

Federation of Experts: Communication Efficient Distributed Inference for Large Language Models

Add code
May 07, 2026
Viaarxiv icon

TRACE: Capability-Targeted Agentic Training

Add code
Apr 07, 2026
Viaarxiv icon

AI+HW 2035: Shaping the Next Decade

Add code
Mar 05, 2026
Viaarxiv icon

Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment

Add code
Feb 12, 2026
Viaarxiv icon

Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

Add code
Nov 14, 2025
Viaarxiv icon

Astra: A Multi-Agent System for GPU Kernel Performance Optimization

Add code
Sep 09, 2025
Viaarxiv icon

Cartridges: Lightweight and general-purpose long context representations via self-study

Add code
Jun 06, 2025
Figure 1 for Cartridges: Lightweight and general-purpose long context representations via self-study
Figure 2 for Cartridges: Lightweight and general-purpose long context representations via self-study
Figure 3 for Cartridges: Lightweight and general-purpose long context representations via self-study
Figure 4 for Cartridges: Lightweight and general-purpose long context representations via self-study
Viaarxiv icon

SPRINT: Enabling Interleaved Planning and Parallelized Execution in Reasoning Models

Add code
Jun 06, 2025
Viaarxiv icon

Exploring Diffusion Transformer Designs via Grafting

Add code
Jun 06, 2025
Viaarxiv icon

Think, Prune, Train, Improve: Scaling Reasoning without Scaling Models

Add code
Apr 25, 2025
Viaarxiv icon