Picture for Bowen Zhou

Bowen Zhou

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

Add code
Jun 23, 2026
Viaarxiv icon

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Add code
Jun 08, 2026
Viaarxiv icon

AMix-2: Establishing Protein as a Native Modality in Large Language Models

Add code
May 29, 2026
Viaarxiv icon

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Add code
May 28, 2026
Viaarxiv icon

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Add code
May 18, 2026
Viaarxiv icon

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Add code
May 13, 2026
Viaarxiv icon

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Add code
May 07, 2026
Viaarxiv icon

Earth-o1: A Grid-free Observation-native Atmospheric World Model

Add code
May 07, 2026
Viaarxiv icon

MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation

Add code
Apr 16, 2026
Viaarxiv icon

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Add code
Apr 06, 2026
Viaarxiv icon