Picture for Shufei Zhang

Shufei Zhang

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Add code
Dec 30, 2025
Viaarxiv icon

An Agentic Framework for Autonomous Materials Computation

Add code
Dec 22, 2025
Figure 1 for An Agentic Framework for Autonomous Materials Computation
Figure 2 for An Agentic Framework for Autonomous Materials Computation
Figure 3 for An Agentic Framework for Autonomous Materials Computation
Figure 4 for An Agentic Framework for Autonomous Materials Computation
Viaarxiv icon

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning

Add code
Oct 02, 2025
Viaarxiv icon

ChemBOMAS: Accelerated BO in Chemistry with LLM-Enhanced Multi-Agent System

Add code
Sep 10, 2025
Viaarxiv icon

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Add code
Aug 25, 2025
Figure 1 for CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Figure 2 for CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Figure 3 for CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Figure 4 for CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Viaarxiv icon

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Add code
Aug 11, 2025
Viaarxiv icon

Iterative Pretraining Framework for Interatomic Potentials

Add code
Jul 27, 2025
Viaarxiv icon

Multiphysics Bench: Benchmarking and Investigating Scientific Machine Learning for Multiphysics PDEs

Add code
May 23, 2025
Figure 1 for Multiphysics Bench: Benchmarking and Investigating Scientific Machine Learning for Multiphysics PDEs
Figure 2 for Multiphysics Bench: Benchmarking and Investigating Scientific Machine Learning for Multiphysics PDEs
Figure 3 for Multiphysics Bench: Benchmarking and Investigating Scientific Machine Learning for Multiphysics PDEs
Figure 4 for Multiphysics Bench: Benchmarking and Investigating Scientific Machine Learning for Multiphysics PDEs
Viaarxiv icon

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Add code
May 22, 2025
Figure 1 for NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Figure 2 for NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Figure 3 for NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Figure 4 for NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification
Viaarxiv icon