Picture for Yusheng Li

Yusheng Li

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Add code
Jun 18, 2026
Viaarxiv icon

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon