Picture for Daniel Fried

Daniel Fried

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Toward Training Superintelligent Software Agents through Self-Play SWE-RL

Add code
Dec 21, 2025
Viaarxiv icon

Measuring Fine-Grained Negotiation Tactics of Humans and LLMs in Diplomacy

Add code
Dec 20, 2025
Viaarxiv icon

Propose, Solve, Verify: Self-Play Through Formal Verification

Add code
Dec 20, 2025
Figure 1 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 2 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 3 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 4 for Propose, Solve, Verify: Self-Play Through Formal Verification
Viaarxiv icon

How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations

Add code
Oct 26, 2025
Viaarxiv icon

Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation

Add code
Oct 10, 2025
Viaarxiv icon

MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization

Add code
Jul 15, 2025
Figure 1 for MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Figure 2 for MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Figure 3 for MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Figure 4 for MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Viaarxiv icon

From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking

Add code
Jun 24, 2025
Viaarxiv icon

mrCAD: Multimodal Refinement of Computer-aided Designs

Add code
Apr 28, 2025
Figure 1 for mrCAD: Multimodal Refinement of Computer-aided Designs
Figure 2 for mrCAD: Multimodal Refinement of Computer-aided Designs
Figure 3 for mrCAD: Multimodal Refinement of Computer-aided Designs
Figure 4 for mrCAD: Multimodal Refinement of Computer-aided Designs
Viaarxiv icon

Inducing Programmatic Skills for Agentic Tasks

Add code
Apr 09, 2025
Figure 1 for Inducing Programmatic Skills for Agentic Tasks
Figure 2 for Inducing Programmatic Skills for Agentic Tasks
Figure 3 for Inducing Programmatic Skills for Agentic Tasks
Figure 4 for Inducing Programmatic Skills for Agentic Tasks
Viaarxiv icon