Picture for Jiaran Zhang

Jiaran Zhang

DockSmith: Scaling Reliable Coding Environments via an Agentic Docker Builder

Add code
Jan 31, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model

Add code
Dec 25, 2025
Figure 1 for Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
Figure 2 for Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
Figure 3 for Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
Figure 4 for Leash: Adaptive Length Penalty and Reward Shaping for Efficient Large Reasoning Model
Viaarxiv icon