Picture for Dawn Song

Dawn Song

University of California, Berkeley

InfoSynth: Information-Guided Benchmark Synthesis for LLMs

Add code
Jan 02, 2026
Viaarxiv icon

How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns

Add code
Dec 30, 2025
Viaarxiv icon

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

Add code
Dec 30, 2025
Viaarxiv icon

Adaptation of Agentic AI

Add code
Dec 22, 2025
Figure 1 for Adaptation of Agentic AI
Figure 2 for Adaptation of Agentic AI
Figure 3 for Adaptation of Agentic AI
Figure 4 for Adaptation of Agentic AI
Viaarxiv icon

FrontierCS: Evolving Challenges for Evolving Intelligence

Add code
Dec 17, 2025
Figure 1 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 2 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 3 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 4 for FrontierCS: Evolving Challenges for Evolving Intelligence
Viaarxiv icon

VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection

Add code
Dec 08, 2025
Viaarxiv icon

Scaling Agent Learning via Experience Synthesis

Add code
Nov 10, 2025
Figure 1 for Scaling Agent Learning via Experience Synthesis
Figure 2 for Scaling Agent Learning via Experience Synthesis
Figure 3 for Scaling Agent Learning via Experience Synthesis
Figure 4 for Scaling Agent Learning via Experience Synthesis
Viaarxiv icon

VMDT: Decoding the Trustworthiness of Video Foundation Models

Add code
Nov 07, 2025
Viaarxiv icon

AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and Beyond

Add code
Sep 30, 2025
Viaarxiv icon

RepIt: Representing Isolated Targets to Steer Language Models

Add code
Sep 16, 2025
Viaarxiv icon