Picture for Bowen Yu

Bowen Yu

additional authors not shown

Reinforcement learning-guided optimization of critical current in high-temperature superconductors

Add code
Oct 25, 2025
Viaarxiv icon

Qwen3Guard Technical Report

Add code
Oct 16, 2025
Viaarxiv icon

A Multi-Agent System for Information Extraction from the Chemical Literature

Add code
Jul 27, 2025
Viaarxiv icon

RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

Add code
Jul 27, 2025
Viaarxiv icon

Group Sequence Policy Optimization

Add code
Jul 24, 2025
Viaarxiv icon

Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code

Add code
Jul 10, 2025
Viaarxiv icon

MARGE: Improving Math Reasoning for LLMs with Guided Exploration

Add code
May 18, 2025
Figure 1 for MARGE: Improving Math Reasoning for LLMs with Guided Exploration
Figure 2 for MARGE: Improving Math Reasoning for LLMs with Guided Exploration
Figure 3 for MARGE: Improving Math Reasoning for LLMs with Guided Exploration
Figure 4 for MARGE: Improving Math Reasoning for LLMs with Guided Exploration
Viaarxiv icon

WorldPM: Scaling Human Preference Modeling

Add code
May 15, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Figure 1 for Qwen3 Technical Report
Figure 2 for Qwen3 Technical Report
Figure 3 for Qwen3 Technical Report
Figure 4 for Qwen3 Technical Report
Viaarxiv icon

START: Self-taught Reasoner with Tools

Add code
Mar 07, 2025
Viaarxiv icon