Picture for Qian Yu

Qian Yu

Marketing and Commercialization Center, JD.com

ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models

Add code
Jun 11, 2025
Viaarxiv icon

Reason-SVG: Hybrid Reward RL for Aha-Moments in Vector Graphics Generation

Add code
May 30, 2025
Viaarxiv icon

Unleashing the Power of Intermediate Domains for Mixed Domain Semi-Supervised Medical Image Segmentation

Add code
May 30, 2025
Viaarxiv icon

ViewCraft3D: High-Fidelity and View-Consistent 3D Vector Graphics Synthesis

Add code
May 26, 2025
Viaarxiv icon

TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments

Add code
May 23, 2025
Viaarxiv icon

Balancing Multi-Target Semi-Supervised Medical Image Segmentation with Collaborative Generalist and Specialists

Add code
Apr 01, 2025
Viaarxiv icon

VRsketch2Gaussian: 3D VR Sketch Guided 3D Object Generation with Gaussian Splatting

Add code
Mar 16, 2025
Viaarxiv icon

From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach

Add code
Dec 17, 2024
Figure 1 for From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Figure 2 for From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Figure 3 for From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Figure 4 for From 2D CAD Drawings to 3D Parametric Models: A Vision-Language Approach
Viaarxiv icon

Empowering LLMs to Understand and Generate Complex Vector Graphics

Add code
Dec 15, 2024
Viaarxiv icon

SVGFusion: Scalable Text-to-SVG Generation via Vector Space Diffusion

Add code
Dec 11, 2024
Viaarxiv icon