Picture for Yixin Cao

Yixin Cao

Reinforcement Learning with Conditional Expectation Reward

Add code
Mar 11, 2026
Viaarxiv icon

GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

Add code
Mar 02, 2026
Viaarxiv icon

DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model

Add code
Feb 27, 2026
Viaarxiv icon

NEX: Neuron Explore-Exploit Scoring for Label-Free Chain-of-Thought Selection and Model Ranking

Add code
Feb 05, 2026
Viaarxiv icon

CoDiQ: Test-Time Scaling for Controllable Difficult Question Generation

Add code
Feb 02, 2026
Viaarxiv icon

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

Add code
Feb 02, 2026
Viaarxiv icon

EMemBench: Interactive Benchmarking of Episodic Memory for VLM Agents

Add code
Jan 23, 2026
Viaarxiv icon

Thinking Traps in Long Chain-of-Thought: A Measurable Study and Trap-Aware Adaptive Restart

Add code
Jan 17, 2026
Viaarxiv icon

What Do LLM Agents Know About Their World? Task2Quiz: A Paradigm for Studying Environment Understanding

Add code
Jan 14, 2026
Viaarxiv icon

ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging

Add code
Jan 12, 2026
Viaarxiv icon