Picture for Yunnan Wang

Yunnan Wang

Vision-Centric Activation and Coordination for Multimodal Large Language Models

Add code
Oct 16, 2025
Viaarxiv icon

Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation

Add code
Oct 01, 2024
Figure 1 for Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Figure 2 for Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Figure 3 for Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Figure 4 for Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation
Viaarxiv icon

Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models

Add code
Jul 26, 2024
Viaarxiv icon

One at A Time: Multi-step Volumetric Probability Distribution Diffusion for Depth Estimation

Add code
Jul 07, 2023
Viaarxiv icon