Picture for Yiyou Sun

Yiyou Sun

The Piggyback Hypothesis of Generalization: Explaining and Mitigating Emergent Misalignment

Add code
Jun 04, 2026
Viaarxiv icon

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

The Long-Horizon Task Mirage? Diagnosing Where and Why Agentic Systems Break

Add code
Apr 13, 2026
Viaarxiv icon

Strategy Executability in Mathematical Reasoning: Leveraging Human-Model Differences for Effective Guidance

Add code
Feb 26, 2026
Viaarxiv icon

Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents

Add code
Feb 13, 2026
Viaarxiv icon

How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns

Add code
Dec 30, 2025
Viaarxiv icon

MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them

Add code
Jul 28, 2025
Figure 1 for MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them
Figure 2 for MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them
Figure 3 for MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them
Figure 4 for MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them
Viaarxiv icon

OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization

Add code
Jun 23, 2025
Viaarxiv icon

Where's the liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content

Add code
May 02, 2025
Viaarxiv icon

Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations

Add code
Apr 17, 2025
Figure 1 for Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations
Figure 2 for Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations
Figure 3 for Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations
Figure 4 for Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations
Viaarxiv icon