Picture for Yixin Cao

Yixin Cao

What Do LLM Agents Know About Their World? Task2Quiz: A Paradigm for Studying Environment Understanding

Add code
Jan 14, 2026
Viaarxiv icon

ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging

Add code
Jan 12, 2026
Viaarxiv icon

SCALER:Synthetic Scalable Adaptive Learning Environment for Reasoning

Add code
Jan 08, 2026
Viaarxiv icon

Do LLMs Signal When They're Right? Evidence from Neuron Agreement

Add code
Oct 30, 2025
Viaarxiv icon

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Add code
Jul 08, 2025
Figure 1 for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Figure 2 for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Figure 3 for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Figure 4 for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Viaarxiv icon

Com$^2$: A Causal-Guided Benchmark for Exploring Complex Commonsense Reasoning in Large Language Models

Add code
Jun 08, 2025
Viaarxiv icon

Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings

Add code
Jun 05, 2025
Viaarxiv icon

Long or short CoT? Investigating Instance-level Switch of Large Reasoning Models

Add code
Jun 04, 2025
Viaarxiv icon

Disentangling Language and Culture for Evaluating Multilingual Large Language Models

Add code
May 30, 2025
Viaarxiv icon

FRAbench and GenEval: Scaling Fine-Grained Aspect Evaluation across Tasks, Modalities

Add code
May 19, 2025
Viaarxiv icon