Picture for Hongbo Peng

Hongbo Peng

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Add code
Mar 11, 2026
Viaarxiv icon

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Add code
Mar 09, 2026
Viaarxiv icon

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

GEBench: Benchmarking Image Generation Models as GUI Environments

Add code
Feb 09, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

A Machine Learning-Based Multimodal Framework for Wearable Sensor-Based Archery Action Recognition and Stress Estimation

Add code
Nov 18, 2025
Viaarxiv icon