Picture for Yangguang Li

Yangguang Li

Skywork UniPic 3.0: Unified Multi-Image Composition via Sequence Modeling

Add code
Jan 22, 2026
Viaarxiv icon

Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface

Add code
Nov 12, 2025
Figure 1 for Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface
Figure 2 for Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface
Figure 3 for Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface
Figure 4 for Faithful Contouring: Near-Lossless 3D Voxel Representation Free from Iso-surface
Viaarxiv icon

Transition Models: Rethinking the Generative Learning Objective

Add code
Sep 04, 2025
Figure 1 for Transition Models: Rethinking the Generative Learning Objective
Figure 2 for Transition Models: Rethinking the Generative Learning Objective
Figure 3 for Transition Models: Rethinking the Generative Learning Objective
Figure 4 for Transition Models: Rethinking the Generative Learning Objective
Viaarxiv icon

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Add code
Jun 26, 2025
Viaarxiv icon

Flow-GRPO: Training Flow Matching Models via Online RL

Add code
May 08, 2025
Figure 1 for Flow-GRPO: Training Flow Matching Models via Online RL
Figure 2 for Flow-GRPO: Training Flow Matching Models via Online RL
Figure 3 for Flow-GRPO: Training Flow Matching Models via Online RL
Figure 4 for Flow-GRPO: Training Flow Matching Models via Online RL
Viaarxiv icon

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models

Add code
Apr 25, 2025
Figure 1 for DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Figure 2 for DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Figure 3 for DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Figure 4 for DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Viaarxiv icon

HoloPart: Generative 3D Part Amodal Segmentation

Add code
Apr 10, 2025
Viaarxiv icon

MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs

Add code
Mar 29, 2025
Viaarxiv icon

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

Add code
Mar 27, 2025
Viaarxiv icon

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon