Picture for Zijian He

Zijian He

HMVLM: Multistage Reasoning-Enhanced Vision-Language Model for Long-Tailed Driving Scenarios

Add code
Jun 06, 2025
Viaarxiv icon

Mod-Adapter: Tuning-Free and Versatile Multi-concept Personalization via Modulation Adapter

Add code
May 24, 2025
Viaarxiv icon

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Add code
Apr 24, 2025
Viaarxiv icon

VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction

Add code
Mar 15, 2025
Viaarxiv icon

Cognify: Supercharging Gen-AI Workflows With Hierarchical Autotuning

Add code
Feb 12, 2025
Viaarxiv icon

Comprehensive Performance Evaluation of YOLOv11, YOLOv10, YOLOv9, YOLOv8 and YOLOv5 on Object Detection of Power Equipment

Add code
Nov 28, 2024
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

Pixel-Space Post-Training of Latent Diffusion Models

Add code
Sep 26, 2024
Figure 1 for Pixel-Space Post-Training of Latent Diffusion Models
Figure 2 for Pixel-Space Post-Training of Latent Diffusion Models
Figure 3 for Pixel-Space Post-Training of Latent Diffusion Models
Figure 4 for Pixel-Space Post-Training of Latent Diffusion Models
Viaarxiv icon

Time-Varying Foot-Placement Control for Underactuated Humanoid Walking on Swaying Rigid Surfaces

Add code
Sep 12, 2024
Viaarxiv icon

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

Add code
Jul 15, 2024
Viaarxiv icon