Picture for Ruosi Wan

Ruosi Wan

StepFun-Prover Preview: Let's Think and Verify Step by Step

Add code
Jul 27, 2025
Viaarxiv icon

Spherical Motion Dynamics of Deep Neural Networks with Batch Normalization and Weight Decay

Add code
Jul 02, 2020
Figure 1 for Spherical Motion Dynamics of Deep Neural Networks with Batch Normalization and Weight Decay
Figure 2 for Spherical Motion Dynamics of Deep Neural Networks with Batch Normalization and Weight Decay
Figure 3 for Spherical Motion Dynamics of Deep Neural Networks with Batch Normalization and Weight Decay
Figure 4 for Spherical Motion Dynamics of Deep Neural Networks with Batch Normalization and Weight Decay
Viaarxiv icon

Angle-based Search Space Shrinking for Neural Architecture Search

Add code
May 01, 2020
Figure 1 for Angle-based Search Space Shrinking for Neural Architecture Search
Figure 2 for Angle-based Search Space Shrinking for Neural Architecture Search
Figure 3 for Angle-based Search Space Shrinking for Neural Architecture Search
Figure 4 for Angle-based Search Space Shrinking for Neural Architecture Search
Viaarxiv icon

Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization

Add code
Jan 19, 2020
Figure 1 for Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization
Figure 2 for Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization
Figure 3 for Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization
Figure 4 for Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization
Viaarxiv icon

Towards Making Deep Transfer Learning Never Hurt

Add code
Nov 18, 2019
Figure 1 for Towards Making Deep Transfer Learning Never Hurt
Figure 2 for Towards Making Deep Transfer Learning Never Hurt
Figure 3 for Towards Making Deep Transfer Learning Never Hurt
Figure 4 for Towards Making Deep Transfer Learning Never Hurt
Viaarxiv icon

Neural Control Variates for Variance Reduction

Add code
Jun 01, 2018
Figure 1 for Neural Control Variates for Variance Reduction
Figure 2 for Neural Control Variates for Variance Reduction
Figure 3 for Neural Control Variates for Variance Reduction
Viaarxiv icon