Picture for Yingxiu Zhao

Yingxiu Zhao

SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments

Add code
Apr 15, 2026
Viaarxiv icon

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Add code
Mar 11, 2026
Viaarxiv icon

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

Step-GUI Technical Report

Add code
Dec 19, 2025
Figure 1 for Step-GUI Technical Report
Figure 2 for Step-GUI Technical Report
Figure 3 for Step-GUI Technical Report
Figure 4 for Step-GUI Technical Report
Viaarxiv icon

GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning

Add code
Apr 17, 2025
Figure 1 for GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
Figure 2 for GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
Figure 3 for GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
Figure 4 for GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
Viaarxiv icon

CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games

Add code
Mar 12, 2025
Figure 1 for CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games
Figure 2 for CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games
Figure 3 for CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games
Figure 4 for CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games
Viaarxiv icon

ChineseSimpleVQA -- "See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models

Add code
Feb 19, 2025
Viaarxiv icon

Automatic Instruction Evolving for Large Language Models

Add code
Jun 02, 2024
Figure 1 for Automatic Instruction Evolving for Large Language Models
Figure 2 for Automatic Instruction Evolving for Large Language Models
Figure 3 for Automatic Instruction Evolving for Large Language Models
Figure 4 for Automatic Instruction Evolving for Large Language Models
Viaarxiv icon

Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents

Add code
Mar 08, 2024
Figure 1 for Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents
Figure 2 for Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents
Figure 3 for Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents
Figure 4 for Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents
Viaarxiv icon