Picture for Cunjian Chen

Cunjian Chen

IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation

Add code
Dec 29, 2025
Viaarxiv icon

AsyncDiff: Asynchronous Timestep Conditioning for Enhanced Text-to-Image Diffusion Inference

Add code
Dec 21, 2025
Viaarxiv icon

SceneDecorator: Towards Scene-Oriented Story Generation with Scene Planning and Scene Consistency

Add code
Oct 27, 2025
Viaarxiv icon

HERO: Hierarchical Extrapolation and Refresh for Efficient World Models

Add code
Aug 25, 2025
Figure 1 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 2 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 3 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 4 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Viaarxiv icon

Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers

Add code
Aug 10, 2025
Figure 1 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 2 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 3 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Figure 4 for Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Viaarxiv icon

Training-free Stylized Text-to-Image Generation with Fast Inference

Add code
May 25, 2025
Viaarxiv icon

3D Surface Reconstruction with Enhanced High-Frequency Details

Add code
May 06, 2025
Viaarxiv icon

Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training

Add code
Dec 11, 2024
Figure 1 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 2 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 3 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 4 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Viaarxiv icon

REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation

Add code
Oct 14, 2024
Figure 1 for REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Figure 2 for REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Figure 3 for REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Figure 4 for REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Viaarxiv icon

A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse

Add code
Aug 20, 2024
Figure 1 for A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse
Figure 2 for A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse
Figure 3 for A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse
Figure 4 for A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse
Viaarxiv icon