Picture for Kai Zou

Kai Zou

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

Add code
May 20, 2026
Viaarxiv icon

Advancing Aesthetic Image Generation via Composition Transfer

Add code
May 06, 2026
Viaarxiv icon

MMEB-V3: Measuring the Performance Gaps of Omni-Modality Embedding Models

Add code
Apr 25, 2026
Viaarxiv icon

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Add code
Apr 09, 2026
Viaarxiv icon

SWE-Next: Scalable Real-World Software Engineering Tasks for Agents

Add code
Mar 21, 2026
Viaarxiv icon

EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and Understanding

Add code
Mar 18, 2026
Viaarxiv icon

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

Add code
Mar 09, 2026
Viaarxiv icon

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization

Add code
Feb 10, 2026
Viaarxiv icon

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Add code
Jun 04, 2025
Viaarxiv icon

Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model

Add code
May 27, 2025
Viaarxiv icon