Picture for ZiDong Wang

ZiDong Wang

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

Add code
May 07, 2026
Viaarxiv icon

FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model

Add code
Oct 17, 2024
Figure 1 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 2 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 3 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 4 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Viaarxiv icon

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Add code
Jul 11, 2024
Figure 1 for PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines
Figure 2 for PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines
Figure 3 for PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines
Figure 4 for PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines
Viaarxiv icon