Picture for Kaipeng Zhang

Kaipeng Zhang

JAMER: Project-Level Code Framework Dataset and Benchmark on Professional Game Engines

Add code
Jun 18, 2026
Viaarxiv icon

ComAct: Reframing Professional Software Manipulation via COM-as-Action Paradigm

Add code
Jun 11, 2026
Viaarxiv icon

Faithful, Enriched, and Precise: Benchmarking Natural-Science Illustration Generation by T2I models

Add code
Jun 04, 2026
Viaarxiv icon

YoCausal: How Far is Video Generation from World Model? A Causality Perspective

Add code
May 28, 2026
Viaarxiv icon

Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality?

Add code
May 21, 2026
Viaarxiv icon

WorldMark: A Unified Benchmark Suite for Interactive Video World Models

Add code
Apr 23, 2026
Viaarxiv icon

Generative World Renderer

Add code
Apr 02, 2026
Viaarxiv icon

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Add code
Mar 26, 2026
Viaarxiv icon

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Add code
Mar 24, 2026
Viaarxiv icon

PyVision-RL: Forging Open Agentic Vision Models via RL

Add code
Feb 24, 2026
Viaarxiv icon