Picture for Ke Cao

Ke Cao

FedCVU: Federated Learning for Cross-View Video Understanding

Add code
Mar 23, 2026
Viaarxiv icon

Cross-Scale Pansharpening via ScaleFormer and the PanScale Benchmark

Add code
Feb 28, 2026
Viaarxiv icon

MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation

Add code
Dec 26, 2025
Viaarxiv icon

Self-supervised Multiplex Consensus Mamba for General Image Fusion

Add code
Dec 24, 2025
Viaarxiv icon

Active Intelligence in Video Avatars via Closed-loop World Modeling

Add code
Dec 23, 2025
Viaarxiv icon

Lay2Story: Extending Diffusion Transformers for Layout-Togglable Story Generation

Add code
Aug 12, 2025
Viaarxiv icon

Distilling Textual Priors from LLM to Efficient Image Fusion

Add code
Apr 09, 2025
Figure 1 for Distilling Textual Priors from LLM to Efficient Image Fusion
Figure 2 for Distilling Textual Priors from LLM to Efficient Image Fusion
Figure 3 for Distilling Textual Priors from LLM to Efficient Image Fusion
Figure 4 for Distilling Textual Priors from LLM to Efficient Image Fusion
Viaarxiv icon

WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation

Add code
Mar 11, 2025
Viaarxiv icon

U-StyDiT: Ultra-high Quality Artistic Style Transfer Using Diffusion Transformers

Add code
Mar 11, 2025
Viaarxiv icon

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Add code
Feb 21, 2025
Figure 1 for RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers
Figure 2 for RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers
Figure 3 for RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers
Figure 4 for RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers
Viaarxiv icon