Picture for Dongzhan Zhou

Dongzhan Zhou

MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models

Add code
Dec 24, 2025
Viaarxiv icon

An Agentic Framework for Autonomous Materials Computation

Add code
Dec 22, 2025
Viaarxiv icon

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

Single-Agent Scaling Fails Multi-Agent Intelligence: Towards Foundation Models with Native Multi-Agent Intelligence

Add code
Dec 16, 2025
Viaarxiv icon

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Add code
Nov 19, 2025
Viaarxiv icon

P1: Mastering Physics Olympiads with Reinforcement Learning

Add code
Nov 17, 2025
Viaarxiv icon

AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Crystalline Materials

Add code
Oct 06, 2025
Figure 1 for AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Crystalline Materials
Figure 2 for AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Crystalline Materials
Figure 3 for AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Crystalline Materials
Figure 4 for AtomWorld: A Benchmark for Evaluating Spatial Reasoning in Large Language Models on Crystalline Materials
Viaarxiv icon

ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System

Add code
Sep 10, 2025
Viaarxiv icon

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Add code
Sep 10, 2025
Figure 1 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 2 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 3 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 4 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Viaarxiv icon

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Figure 1 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 2 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 3 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Figure 4 for A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Viaarxiv icon