Picture for Boxian Ai

Boxian Ai

OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents

Add code
May 28, 2026
Viaarxiv icon

DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model

Add code
Feb 27, 2026
Viaarxiv icon