Picture for Binyuan Hui

Binyuan Hui

additional authors not shown

WorldPM: Scaling Human Preference Modeling

Add code
May 15, 2025
Viaarxiv icon

Parallel Scaling Law for Language Models

Add code
May 15, 2025
Figure 1 for Parallel Scaling Law for Language Models
Figure 2 for Parallel Scaling Law for Language Models
Figure 3 for Parallel Scaling Law for Language Models
Figure 4 for Parallel Scaling Law for Language Models
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Figure 1 for Qwen3 Technical Report
Figure 2 for Qwen3 Technical Report
Figure 3 for Qwen3 Technical Report
Figure 4 for Qwen3 Technical Report
Viaarxiv icon

SWE-smith: Scaling Data for Software Engineering Agents

Add code
Apr 30, 2025
Viaarxiv icon

START: Self-taught Reasoner with Tools

Add code
Mar 07, 2025
Viaarxiv icon

Multi-Agent Collaboration for Multilingual Code Instruction Tuning

Add code
Feb 11, 2025
Viaarxiv icon

Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation

Add code
Feb 10, 2025
Viaarxiv icon

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Add code
Jan 03, 2025
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Figure 1 for Qwen2.5 Technical Report
Figure 2 for Qwen2.5 Technical Report
Figure 3 for Qwen2.5 Technical Report
Figure 4 for Qwen2.5 Technical Report
Viaarxiv icon

ExecRepoBench: Multi-level Executable Code Completion Evaluation

Add code
Dec 16, 2024
Figure 1 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 2 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 3 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 4 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Viaarxiv icon