Picture for Yuhong Zhang

Yuhong Zhang

AnimeColor: Reference-based Animation Colorization with Diffusion Transformers

Add code
Jul 27, 2025
Viaarxiv icon

Graph Representations for Reading Comprehension Analysis using Large Language Model and Eye-Tracking Biomarker

Add code
Jul 16, 2025
Viaarxiv icon

NOVA3D: Normal Aligned Video Diffusion Model for Single Image to 3D Generation

Add code
Jun 09, 2025
Viaarxiv icon

HumanMM: Global Human Motion Recovery from Multi-shot Videos

Add code
Mar 10, 2025
Viaarxiv icon

Frequency-Based Alignment of EEG and Audio Signals Using Contrastive Learning and SincNet for Auditory Attention Detection

Add code
Mar 06, 2025
Viaarxiv icon

Consistent Video Colorization via Palette Guidance

Add code
Jan 31, 2025
Figure 1 for Consistent Video Colorization via Palette Guidance
Figure 2 for Consistent Video Colorization via Palette Guidance
Figure 3 for Consistent Video Colorization via Palette Guidance
Figure 4 for Consistent Video Colorization via Palette Guidance
Viaarxiv icon

Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset

Add code
Jan 09, 2025
Figure 1 for Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset
Figure 2 for Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset
Figure 3 for Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset
Figure 4 for Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset
Viaarxiv icon

Towards Effective Graph Rationalization via Boosting Environment Diversity

Add code
Dec 17, 2024
Figure 1 for Towards Effective Graph Rationalization via Boosting Environment Diversity
Figure 2 for Towards Effective Graph Rationalization via Boosting Environment Diversity
Figure 3 for Towards Effective Graph Rationalization via Boosting Environment Diversity
Figure 4 for Towards Effective Graph Rationalization via Boosting Environment Diversity
Viaarxiv icon

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Add code
Nov 21, 2024
Figure 1 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 2 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 3 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 4 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Viaarxiv icon

MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration

Add code
Jul 04, 2024
Figure 1 for MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration
Figure 2 for MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration
Figure 3 for MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration
Figure 4 for MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration
Viaarxiv icon