Picture for Chujie Zheng

Chujie Zheng

WorldPM: Scaling Human Preference Modeling

Add code
May 15, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

Aligning Instruction Tuning with Pre-training

Add code
Jan 16, 2025
Figure 1 for Aligning Instruction Tuning with Pre-training
Figure 2 for Aligning Instruction Tuning with Pre-training
Figure 3 for Aligning Instruction Tuning with Pre-training
Figure 4 for Aligning Instruction Tuning with Pre-training
Viaarxiv icon

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Add code
Jan 13, 2025
Figure 1 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 2 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 3 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 4 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Viaarxiv icon

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Add code
Dec 10, 2024
Figure 1 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 2 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 3 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Figure 4 for ProcessBench: Identifying Process Errors in Mathematical Reasoning
Viaarxiv icon

Yi-Lightning Technical Report

Add code
Dec 03, 2024
Figure 1 for Yi-Lightning Technical Report
Figure 2 for Yi-Lightning Technical Report
Figure 3 for Yi-Lightning Technical Report
Figure 4 for Yi-Lightning Technical Report
Viaarxiv icon

Semantic Search Evaluation

Add code
Oct 28, 2024
Figure 1 for Semantic Search Evaluation
Figure 2 for Semantic Search Evaluation
Figure 3 for Semantic Search Evaluation
Figure 4 for Semantic Search Evaluation
Viaarxiv icon

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Add code
Jul 03, 2024
Figure 1 for Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
Figure 2 for Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
Figure 3 for Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
Figure 4 for Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks
Viaarxiv icon

Weak-to-Strong Extrapolation Expedites Alignment

Add code
Apr 25, 2024
Figure 1 for Weak-to-Strong Extrapolation Expedites Alignment
Figure 2 for Weak-to-Strong Extrapolation Expedites Alignment
Figure 3 for Weak-to-Strong Extrapolation Expedites Alignment
Figure 4 for Weak-to-Strong Extrapolation Expedites Alignment
Viaarxiv icon