Picture for Yanrui Wu

Yanrui Wu

GeoLaux: A Benchmark for Evaluating MLLMs' Geometry Performance on Long-Step Problems Requiring Auxiliary Lines

Add code
Aug 08, 2025
Viaarxiv icon

PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning

Add code
Feb 17, 2025
Figure 1 for PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning
Figure 2 for PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning
Figure 3 for PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning
Figure 4 for PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning
Viaarxiv icon

DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams

Add code
Nov 26, 2024
Figure 1 for DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams
Figure 2 for DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams
Figure 3 for DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams
Figure 4 for DiagramQG: A Dataset for Generating Concept-Focused Questions from Diagrams
Viaarxiv icon