Picture for Gang Yu

Gang Yu

Department of Biomedical Engineering, School of Basic Medical Sciences, Central South University, Changsha, China

Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer

Add code
Aug 12, 2025
Viaarxiv icon

SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning

Add code
Aug 08, 2025
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Viaarxiv icon

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Add code
Jun 09, 2025
Viaarxiv icon

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Add code
May 30, 2025
Viaarxiv icon

DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds

Add code
May 30, 2025
Viaarxiv icon

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Add code
May 22, 2025
Viaarxiv icon

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Add code
May 12, 2025
Viaarxiv icon

Step1X-Edit: A Practical Framework for General Image Editing

Add code
Apr 24, 2025
Viaarxiv icon

StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians

Add code
Apr 21, 2025
Viaarxiv icon