Picture for Hanwang Zhang

Hanwang Zhang

DEPO: Dual-Efficiency Preference Optimization for LLM Agents

Add code
Nov 19, 2025
Viaarxiv icon

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Add code
Nov 14, 2025
Viaarxiv icon

NeuSpring: Neural Spring Fields for Reconstruction and Simulation of Deformable Objects from Videos

Add code
Nov 11, 2025
Viaarxiv icon

Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning

Add code
Jul 10, 2025
Viaarxiv icon

DragNeXt: Rethinking Drag-Based Image Editing

Add code
Jun 09, 2025
Viaarxiv icon

3D Question Answering via only 2D Vision-Language Models

Add code
May 28, 2025
Viaarxiv icon

Exploring Consciousness in LLMs: A Systematic Survey of Theories, Implementations, and Frontier Risks

Add code
May 26, 2025
Viaarxiv icon

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Add code
May 23, 2025
Viaarxiv icon

Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image

Add code
May 20, 2025
Viaarxiv icon

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Add code
May 18, 2025
Viaarxiv icon