Picture for Lijun Li

Lijun Li

SEARL: Joint Optimization of Policy and Tool Graph Memory for Self-Evolving Agents

Add code
Apr 09, 2026
Viaarxiv icon

Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning

Add code
Mar 30, 2026
Viaarxiv icon

TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration

Add code
Mar 24, 2026
Viaarxiv icon

Stable Adaptive Thinking via Advantage Shaping and Length-Aware Gradient Regulation

Add code
Feb 26, 2026
Viaarxiv icon

TabSieve: Explicit In-Table Evidence Selection for Tabular Prediction

Add code
Feb 12, 2026
Viaarxiv icon

DeepSight: An All-in-One LM Safety Toolkit

Add code
Feb 12, 2026
Viaarxiv icon

ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning

Add code
Feb 10, 2026
Viaarxiv icon

Toward Efficient Agents: Memory, Tool learning, and Planning

Add code
Jan 20, 2026
Viaarxiv icon

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Add code
Jan 15, 2026
Viaarxiv icon

ProGuard: Towards Proactive Multimodal Safeguard

Add code
Dec 29, 2025
Viaarxiv icon