Picture for Yuhang Ma

Yuhang Ma

CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities

Add code
Aug 20, 2025
Viaarxiv icon

NanoControl: A Lightweight Framework for Precise and Efficient Control in Diffusion Transformer

Add code
Aug 14, 2025
Viaarxiv icon

FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer

Add code
Aug 07, 2025
Viaarxiv icon

A Robust Real-Time Lane Detection Method with Fog-Enhanced Feature Fusion for Foggy Conditions

Add code
Apr 08, 2025
Viaarxiv icon

YOLO-LLTS: Real-Time Low-Light Traffic Sign Detection via Prior-Guided Enhancement and Multi-Branch Feature Interaction

Add code
Mar 18, 2025
Viaarxiv icon

PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models

Add code
Mar 13, 2025
Viaarxiv icon

NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers

Add code
Mar 12, 2025
Viaarxiv icon

WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation

Add code
Mar 11, 2025
Viaarxiv icon

U-StyDiT: Ultra-high Quality Artistic Style Transfer Using Diffusion Transformers

Add code
Mar 11, 2025
Viaarxiv icon

HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation

Add code
Oct 18, 2024
Viaarxiv icon