Picture for Hanwang Zhang

Hanwang Zhang

DragNeXt: Rethinking Drag-Based Image Editing

Add code
Jun 09, 2025
Viaarxiv icon

3D Question Answering via only 2D Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

Exploring Consciousness in LLMs: A Systematic Survey of Theories, Implementations, and Frontier Risks

Add code
May 26, 2025
Viaarxiv icon

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Add code
May 23, 2025
Viaarxiv icon

Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image

Add code
May 20, 2025
Viaarxiv icon

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 18, 2025
Viaarxiv icon

Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 12, 2025
Viaarxiv icon

On Path to Multimodal Generalist: General-Level and General-Bench

Add code
May 07, 2025
Viaarxiv icon

Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization

Add code
Apr 25, 2025
Viaarxiv icon

Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning

Add code
Apr 22, 2025
Viaarxiv icon