Picture for Diyi Yang

Diyi Yang

Stanford University

The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas

Add code
Jun 25, 2025
Viaarxiv icon

StorySage: Conversational Autobiography Writing Powered by a Multi-Agent Framework

Add code
Jun 17, 2025
Viaarxiv icon

When Large Language Models are Reliable for Judging Empathic Communication

Add code
Jun 11, 2025
Viaarxiv icon

Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce

Add code
Jun 06, 2025
Viaarxiv icon

SynthesizeMe! Inducing Persona-Guided Prompts for Personalized Reward Models in LLMs

Add code
Jun 05, 2025
Viaarxiv icon

When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration

Add code
Jun 05, 2025
Viaarxiv icon

Creating General User Models from Computer Use

Add code
May 19, 2025
Viaarxiv icon

Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors

Add code
May 17, 2025
Viaarxiv icon

AutoLibra: Agent Metric Induction from Open-Ended Feedback

Add code
May 05, 2025
Viaarxiv icon

SWE-smith: Scaling Data for Software Engineering Agents

Add code
Apr 30, 2025
Viaarxiv icon