Picture for Hui Zhang

Hui Zhang

Centre for Medical Image Computing, Department of Computer Science, University College London, UK

FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

Add code
Jun 05, 2025
Viaarxiv icon

CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Add code
May 25, 2025
Viaarxiv icon

ChartGalaxy: A Dataset for Infographic Chart Understanding and Generation

Add code
May 24, 2025
Viaarxiv icon

ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting

Add code
Apr 10, 2025
Viaarxiv icon

RobustDexGrasp: Robust Dexterous Grasping of General Objects from Single-view Perception

Add code
Apr 07, 2025
Viaarxiv icon

HQViT: Hybrid Quantum Vision Transformer for Image Classification

Add code
Apr 03, 2025
Viaarxiv icon

Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images

Add code
Mar 21, 2025
Viaarxiv icon

MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance

Add code
Mar 20, 2025
Viaarxiv icon

BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers

Add code
Mar 20, 2025
Viaarxiv icon

R$^2$: A LLM Based Novel-to-Screenplay Generation Framework with Causal Plot Graphs

Add code
Mar 19, 2025
Viaarxiv icon