Picture for Sharon Li

Sharon Li

Auditing Agent Harness Safety

Add code
May 14, 2026
Viaarxiv icon

Why Multimodal In-Context Learning Lags Behind? Unveiling the Inner Mechanisms and Bottlenecks

Add code
Apr 15, 2026
Viaarxiv icon

TorchUMM: A Unified Multimodal Model Codebase for Evaluation, Analysis, and Post-training

Add code
Apr 12, 2026
Viaarxiv icon

Can LLMs Learn to Reason Robustly under Noisy Supervision?

Add code
Apr 05, 2026
Viaarxiv icon

Combating Data Laundering in LLM Training

Add code
Apr 02, 2026
Viaarxiv icon

VAUQ: Vision-Aware Uncertainty Quantification for LVLM Self-Evaluation

Add code
Feb 24, 2026
Viaarxiv icon

LAD: Learning Advantage Distribution for Reasoning

Add code
Feb 23, 2026
Viaarxiv icon

How Retrieved Context Shapes Internal Representations in RAG

Add code
Feb 23, 2026
Viaarxiv icon

Thinking Makes LLM Agents Introverted: How Mandatory Thinking Can Backfire in User-Engaged Agents

Add code
Feb 08, 2026
Viaarxiv icon

Towards Reducible Uncertainty Modeling for Reliable Large Language Model Agents

Add code
Feb 04, 2026
Viaarxiv icon