Picture for Zhenye Gan

Zhenye Gan

The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection

Add code
Dec 23, 2025
Viaarxiv icon

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

Add code
Dec 15, 2025
Viaarxiv icon

Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10$\times$

Add code
Dec 15, 2025
Viaarxiv icon

RoleRMBench & RoleRM: Towards Reward Modeling for Profile-Based Role Play in Dialogue Systems

Add code
Dec 11, 2025
Viaarxiv icon

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection

Add code
Apr 19, 2025
Viaarxiv icon

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

Add code
Mar 12, 2025
Figure 1 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 2 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 3 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 4 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Viaarxiv icon

PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation

Add code
Mar 09, 2025
Viaarxiv icon

SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation

Add code
Dec 30, 2024
Figure 1 for SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation
Figure 2 for SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation
Figure 3 for SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation
Figure 4 for SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation
Viaarxiv icon

MobileMamba: Lightweight Multi-Receptive Visual Mamba Network

Add code
Nov 24, 2024
Figure 1 for MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Figure 2 for MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Figure 3 for MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Figure 4 for MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Viaarxiv icon