Picture for Zhenru Zhang

Zhenru Zhang

additional authors not shown

START: Self-taught Reasoner with Tools

Add code
Mar 07, 2025
Viaarxiv icon

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Add code
Jan 13, 2025
Figure 1 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 2 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 3 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 4 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Viaarxiv icon

Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning

Add code
Dec 19, 2024
Figure 1 for Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
Figure 2 for Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
Figure 3 for Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
Figure 4 for Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Viaarxiv icon

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Add code
Dec 10, 2024
Figure 1 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 2 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 3 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 4 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Viaarxiv icon

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Add code
Sep 18, 2024
Viaarxiv icon

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

Qwen Technical Report

Add code
Sep 28, 2023
Figure 1 for Qwen Technical Report
Figure 2 for Qwen Technical Report
Figure 3 for Qwen Technical Report
Figure 4 for Qwen Technical Report
Viaarxiv icon

Contrastive Demonstration Tuning for Pre-trained Language Models

Add code
Apr 18, 2022
Figure 1 for Contrastive Demonstration Tuning for Pre-trained Language Models
Figure 2 for Contrastive Demonstration Tuning for Pre-trained Language Models
Figure 3 for Contrastive Demonstration Tuning for Pre-trained Language Models
Figure 4 for Contrastive Demonstration Tuning for Pre-trained Language Models
Viaarxiv icon

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Add code
Jan 24, 2022
Figure 1 for DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population
Figure 2 for DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population
Figure 3 for DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population
Figure 4 for DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population
Viaarxiv icon