Picture for Yixin Cao

Yixin Cao

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Add code
Jul 08, 2025
Viaarxiv icon

Com$^2$: A Causal-Guided Benchmark for Exploring Complex Commonsense Reasoning in Large Language Models

Add code
Jun 08, 2025
Viaarxiv icon

Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings

Add code
Jun 05, 2025
Viaarxiv icon

Long or short CoT? Investigating Instance-level Switch of Large Reasoning Models

Add code
Jun 04, 2025
Viaarxiv icon

Disentangling Language and Culture for Evaluating Multilingual Large Language Models

Add code
May 30, 2025
Viaarxiv icon

FRAbench and GenEval: Scaling Fine-Grained Aspect Evaluation across Tasks, Modalities

Add code
May 19, 2025
Viaarxiv icon

Teach2Eval: An Indirect Evaluation Method for LLM by Judging How It Teaches

Add code
May 18, 2025
Viaarxiv icon

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

Add code
Apr 26, 2025
Viaarxiv icon

Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law

Add code
Apr 10, 2025
Viaarxiv icon

Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs

Add code
Mar 03, 2025
Figure 1 for Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs
Figure 2 for Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs
Figure 3 for Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs
Figure 4 for Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs
Viaarxiv icon