Picture for Zexi Li

Zexi Li

WorldLines: Benchmarking and Modeling Long-Horizon Stateful Embodied Agents

Add code
Jun 17, 2026
Viaarxiv icon

Sparsity Curse: Understanding RLVR Model Parameter Space from Model Merging

Add code
Jun 16, 2026
Viaarxiv icon

AnyEdit++: Adaptive Long-Form Knowledge Editing via Bayesian Surprise

Add code
May 31, 2026
Viaarxiv icon

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

IntentRL: Training Proactive User-intent Agents for Open-ended Deep Research via Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

R$^3$L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification

Add code
Jan 07, 2026
Viaarxiv icon

Step-DeepResearch Technical Report

Add code
Dec 24, 2025
Viaarxiv icon

FedEve: On Bridging the Client Drift and Period Drift for Cross-device Federated Learning

Add code
Aug 20, 2025
Viaarxiv icon

AdaFusion: Prompt-Guided Inference with Adaptive Fusion of Pathology Foundation Models

Add code
Aug 07, 2025
Viaarxiv icon

Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?

Add code
May 26, 2025
Viaarxiv icon