Picture for Benteng Chen

Benteng Chen

PolyReal: A Benchmark for Real-World Polymer Science Workflows

Add code
Apr 03, 2026
Viaarxiv icon

Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning

Add code
Oct 02, 2025
Viaarxiv icon

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Add code
Aug 25, 2025
Figure 1 for CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Figure 2 for CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Figure 3 for CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Figure 4 for CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Viaarxiv icon

DSADF: Thinking Fast and Slow for Decision Making

Add code
May 13, 2025
Figure 1 for DSADF: Thinking Fast and Slow for Decision Making
Figure 2 for DSADF: Thinking Fast and Slow for Decision Making
Figure 3 for DSADF: Thinking Fast and Slow for Decision Making
Figure 4 for DSADF: Thinking Fast and Slow for Decision Making
Viaarxiv icon