Picture for Daniel Fried

Daniel Fried

How Well Does Agent Development Reflect Real-World Work?

Add code
Mar 01, 2026
Viaarxiv icon

Hybrid-Gym: Training Coding Agents to Generalize Across Tasks

Add code
Feb 18, 2026
Viaarxiv icon

Reasoning with Latent Tokens in Diffusion Language Models

Add code
Feb 03, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Toward Training Superintelligent Software Agents through Self-Play SWE-RL

Add code
Dec 21, 2025
Viaarxiv icon

Propose, Solve, Verify: Self-Play Through Formal Verification

Add code
Dec 20, 2025
Figure 1 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 2 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 3 for Propose, Solve, Verify: Self-Play Through Formal Verification
Figure 4 for Propose, Solve, Verify: Self-Play Through Formal Verification
Viaarxiv icon

Measuring Fine-Grained Negotiation Tactics of Humans and LLMs in Diplomacy

Add code
Dec 20, 2025
Viaarxiv icon

How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations

Add code
Oct 26, 2025
Viaarxiv icon

Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation

Add code
Oct 10, 2025
Viaarxiv icon

MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization

Add code
Jul 15, 2025
Figure 1 for MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Figure 2 for MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Figure 3 for MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Figure 4 for MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization
Viaarxiv icon