Picture for Zhaoxiang Zhang

Zhaoxiang Zhang

AutoGUI-v2: A Comprehensive Multi-Modal GUI Functionality Understanding Benchmark

Add code
Apr 27, 2026
Viaarxiv icon

GoClick: Lightweight Element Grounding Model for Autonomous GUI Interaction

Add code
Apr 27, 2026
Viaarxiv icon

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Add code
Apr 20, 2026
Viaarxiv icon

CodeTracer: Towards Traceable Agent States

Add code
Apr 14, 2026
Viaarxiv icon

ReinDriveGen: Reinforcement Post-Training for Out-of-Distribution Driving Scene Generation

Add code
Apr 01, 2026
Viaarxiv icon

DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving

Add code
Mar 11, 2026
Viaarxiv icon

GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generatio

Add code
Feb 24, 2026
Viaarxiv icon

FeatureBench: Benchmarking Agentic Coding for Complex Feature Development

Add code
Feb 11, 2026
Viaarxiv icon

WorldArena: A Unified Benchmark for Evaluating Perception and Functional Utility of Embodied World Models

Add code
Feb 09, 2026
Viaarxiv icon

NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

Add code
Jan 01, 2026
Viaarxiv icon