Picture for Beichen Zhang

Beichen Zhang

additional authors not shown

Qwen3 Technical Report

Add code
May 14, 2025
Viaarxiv icon

Slow Thinking for Sequential Recommendation

Add code
Apr 13, 2025
Viaarxiv icon

START: Self-taught Reasoner with Tools

Add code
Mar 07, 2025
Viaarxiv icon

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Add code
Mar 06, 2025
Viaarxiv icon

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Add code
Jan 13, 2025
Figure 1 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 2 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 3 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 4 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Viaarxiv icon

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Add code
Jan 06, 2025
Figure 1 for BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
Figure 2 for BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
Figure 3 for BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
Figure 4 for BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Viaarxiv icon

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Add code
Dec 10, 2024
Figure 1 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 2 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 3 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 4 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Viaarxiv icon

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Add code
Sep 18, 2024
Viaarxiv icon

Downstream-Pretext Domain Knowledge Traceback for Active Learning

Add code
Jul 20, 2024
Figure 1 for Downstream-Pretext Domain Knowledge Traceback for Active Learning
Figure 2 for Downstream-Pretext Domain Knowledge Traceback for Active Learning
Figure 3 for Downstream-Pretext Domain Knowledge Traceback for Active Learning
Figure 4 for Downstream-Pretext Domain Knowledge Traceback for Active Learning
Viaarxiv icon