Picture for Zhonghua Zhai

Zhonghua Zhai

MMCORE: MultiModal COnnection with Representation Aligned Latent Embeddings

Add code
Apr 21, 2026
Viaarxiv icon

Seedance 2.0: Advancing Video Generation for World Complexity

Add code
Apr 15, 2026
Viaarxiv icon

Composable Visual Tokenizers with Generator-Free Diagnostics of Learnability

Add code
Feb 03, 2026
Viaarxiv icon

Revisiting Multi-Task Visual Representation Learning

Add code
Jan 20, 2026
Viaarxiv icon

Universal Video Temporal Grounding with Generative Multi-modal Large Language Models

Add code
Jun 23, 2025
Viaarxiv icon

SeedEdit 3.0: Fast and High-Quality Generative Image Editing

Add code
Jun 06, 2025
Viaarxiv icon

Seedream 3.0 Technical Report

Add code
Apr 16, 2025
Figure 1 for Seedream 3.0 Technical Report
Figure 2 for Seedream 3.0 Technical Report
Figure 3 for Seedream 3.0 Technical Report
Figure 4 for Seedream 3.0 Technical Report
Viaarxiv icon

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Add code
Mar 10, 2025
Viaarxiv icon

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Add code
Apr 26, 2024
Figure 1 for Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos
Figure 2 for Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos
Figure 3 for Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos
Figure 4 for Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos
Viaarxiv icon

Cell Variational Information Bottleneck Network

Add code
Mar 29, 2024
Figure 1 for Cell Variational Information Bottleneck Network
Figure 2 for Cell Variational Information Bottleneck Network
Figure 3 for Cell Variational Information Bottleneck Network
Figure 4 for Cell Variational Information Bottleneck Network
Viaarxiv icon