Picture for Hongyang Wei

Hongyang Wei

Learning Cross-View Object Correspondence via Cycle-Consistent Mask Prediction

Add code
Feb 22, 2026
Viaarxiv icon

UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing

Add code
Feb 15, 2026
Viaarxiv icon

Spatial Chain-of-Thought: Bridging Understanding and Generation Models for Spatial Reasoning Generation

Add code
Feb 12, 2026
Viaarxiv icon

SpatialReward: Bridging the Perception Gap in Online RL for Image Editing via Explicit Spatial Reasoning

Add code
Feb 07, 2026
Viaarxiv icon

Joint Reward Modeling: Internalizing Chain-of-Thought for Efficient Visual Reward Models

Add code
Feb 07, 2026
Viaarxiv icon

Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Add code
Jan 29, 2026
Viaarxiv icon

Skywork UniPic 3.0: Unified Multi-Image Composition via Sequence Modeling

Add code
Jan 22, 2026
Viaarxiv icon

MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition

Add code
Dec 08, 2025
Figure 1 for MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Figure 2 for MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Figure 3 for MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Figure 4 for MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition
Viaarxiv icon

Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model

Add code
Sep 04, 2025
Figure 1 for Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
Figure 2 for Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
Figure 3 for Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
Figure 4 for Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model
Viaarxiv icon

Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models

Add code
Mar 14, 2025
Figure 1 for Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Figure 2 for Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Figure 3 for Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Figure 4 for Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Viaarxiv icon