Picture for Jianjian Sun

Jianjian Sun

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Figure 1 for Step-Audio 2 Technical Report
Figure 2 for Step-Audio 2 Technical Report
Figure 3 for Step-Audio 2 Technical Report
Figure 4 for Step-Audio 2 Technical Report
Viaarxiv icon

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Figure 1 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 2 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 3 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 4 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Viaarxiv icon

Perception-R1: Pioneering Perception Policy with Reinforcement Learning

Add code
Apr 10, 2025
Figure 1 for Perception-R1: Pioneering Perception Policy with Reinforcement Learning
Figure 2 for Perception-R1: Pioneering Perception Policy with Reinforcement Learning
Figure 3 for Perception-R1: Pioneering Perception Policy with Reinforcement Learning
Figure 4 for Perception-R1: Pioneering Perception Policy with Reinforcement Learning
Viaarxiv icon

Perception in Reflection

Add code
Apr 09, 2025
Figure 1 for Perception in Reflection
Figure 2 for Perception in Reflection
Figure 3 for Perception in Reflection
Figure 4 for Perception in Reflection
Viaarxiv icon

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

Unhackable Temporal Rewarding for Scalable Video MLLMs

Add code
Feb 17, 2025
Figure 1 for Unhackable Temporal Rewarding for Scalable Video MLLMs
Figure 2 for Unhackable Temporal Rewarding for Scalable Video MLLMs
Figure 3 for Unhackable Temporal Rewarding for Scalable Video MLLMs
Figure 4 for Unhackable Temporal Rewarding for Scalable Video MLLMs
Viaarxiv icon

PerPO: Perceptual Preference Optimization via Discriminative Rewarding

Add code
Feb 05, 2025
Figure 1 for PerPO: Perceptual Preference Optimization via Discriminative Rewarding
Figure 2 for PerPO: Perceptual Preference Optimization via Discriminative Rewarding
Figure 3 for PerPO: Perceptual Preference Optimization via Discriminative Rewarding
Figure 4 for PerPO: Perceptual Preference Optimization via Discriminative Rewarding
Viaarxiv icon

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Add code
Dec 30, 2024
Figure 1 for Slow Perception: Let's Perceive Geometric Figures Step-by-step
Figure 2 for Slow Perception: Let's Perceive Geometric Figures Step-by-step
Figure 3 for Slow Perception: Let's Perceive Geometric Figures Step-by-step
Figure 4 for Slow Perception: Let's Perceive Geometric Figures Step-by-step
Viaarxiv icon

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Add code
Sep 03, 2024
Figure 1 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 2 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 3 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Figure 4 for General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Viaarxiv icon