Image


StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets

Add code
Jun 09, 2025
Viaarxiv icon

Dreamland: Controllable World Creation with Simulator and Generative Models

Add code
Jun 09, 2025
Viaarxiv icon

Aligning Text, Images, and 3D Structure Token-by-Token

Add code
Jun 09, 2025
Viaarxiv icon

MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation

Add code
Jun 09, 2025
Viaarxiv icon

Generative Modeling of Weights: Generalization or Memorization?

Add code
Jun 09, 2025
Viaarxiv icon

UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References

Add code
Jun 09, 2025
Viaarxiv icon

PairEdit: Learning Semantic Variations for Exemplar-based Image Editing

Add code
Jun 09, 2025
Viaarxiv icon

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Add code
Jun 09, 2025
Viaarxiv icon

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Add code
Jun 09, 2025
Viaarxiv icon

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design

Add code
Jun 09, 2025
Viaarxiv icon