Picture for Jiachun Jin

Jiachun Jin

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Add code
Apr 02, 2026
Viaarxiv icon

Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders

Add code
Jan 15, 2026
Viaarxiv icon

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads

Add code
Nov 28, 2024
Figure 1 for Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
Figure 2 for Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
Figure 3 for Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
Figure 4 for Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads
Viaarxiv icon