Picture for Cunjian Chen

Cunjian Chen

HERO: Hierarchical Extrapolation and Refresh for Efficient World Models

Add code
Aug 25, 2025
Figure 1 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 2 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 3 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 4 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Viaarxiv icon

Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers

Add code
Aug 10, 2025
Viaarxiv icon

Training-free Stylized Text-to-Image Generation with Fast Inference

Add code
May 25, 2025
Viaarxiv icon

3D Surface Reconstruction with Enhanced High-Frequency Details

Add code
May 06, 2025
Viaarxiv icon

Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training

Add code
Dec 11, 2024
Figure 1 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 2 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 3 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Figure 4 for Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
Viaarxiv icon

REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation

Add code
Oct 14, 2024
Figure 1 for REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Figure 2 for REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Figure 3 for REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Figure 4 for REHRSeg: Unleashing the Power of Self-Supervised Super-Resolution for Resource-Efficient 3D MRI Segmentation
Viaarxiv icon

A Grey-box Attack against Latent Diffusion Model-based Image Editing by Posterior Collapse

Add code
Aug 20, 2024
Viaarxiv icon

A Multi-task Adversarial Attack Against Face Authentication

Add code
Aug 15, 2024
Viaarxiv icon

DiffX: Guide Your Layout to Cross-Modal Generative Modeling

Add code
Jul 28, 2024
Figure 1 for DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Figure 2 for DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Figure 3 for DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Figure 4 for DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Viaarxiv icon

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

Add code
Jul 23, 2024
Viaarxiv icon