Picture for Kang An

Kang An

Step-DeepResearch Technical Report

Add code
Dec 24, 2025
Viaarxiv icon

Step-GUI Technical Report

Add code
Dec 19, 2025
Viaarxiv icon

MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning

Add code
Dec 08, 2025
Viaarxiv icon

Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs

Add code
Oct 01, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Figure 1 for Step-Audio 2 Technical Report
Figure 2 for Step-Audio 2 Technical Report
Figure 3 for Step-Audio 2 Technical Report
Figure 4 for Step-Audio 2 Technical Report
Viaarxiv icon

StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization

Add code
May 21, 2025
Viaarxiv icon

ASGO: Adaptive Structured Gradient Optimization

Add code
Mar 26, 2025
Figure 1 for ASGO: Adaptive Structured Gradient Optimization
Figure 2 for ASGO: Adaptive Structured Gradient Optimization
Figure 3 for ASGO: Adaptive Structured Gradient Optimization
Figure 4 for ASGO: Adaptive Structured Gradient Optimization
Viaarxiv icon

Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model

Add code
Mar 14, 2025
Viaarxiv icon

Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes

Add code
Mar 06, 2025
Figure 1 for Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes
Figure 2 for Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes
Figure 3 for Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes
Figure 4 for Geometry-Constrained Monocular Scale Estimation Using Semantic Segmentation for Dynamic Scenes
Viaarxiv icon

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon