Picture for Dongzhi Jiang

Dongzhi Jiang

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Add code
Mar 09, 2026
Viaarxiv icon

Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation

Add code
Dec 11, 2025
Viaarxiv icon

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Add code
Oct 30, 2025
Viaarxiv icon

Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation

Add code
Aug 13, 2025
Figure 1 for Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Figure 2 for Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Figure 3 for Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Figure 4 for Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Viaarxiv icon

MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Add code
May 01, 2025
Viaarxiv icon

ADT: Tuning Diffusion Models with Adversarial Supervision

Add code
Apr 15, 2025
Figure 1 for ADT: Tuning Diffusion Models with Adversarial Supervision
Figure 2 for ADT: Tuning Diffusion Models with Adversarial Supervision
Figure 3 for ADT: Tuning Diffusion Models with Adversarial Supervision
Figure 4 for ADT: Tuning Diffusion Models with Adversarial Supervision
Viaarxiv icon

SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems

Add code
Mar 13, 2025
Viaarxiv icon

PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models

Add code
Mar 13, 2025
Figure 1 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 2 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 3 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 4 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Viaarxiv icon