Picture for Kwan-Yee K. Wong

Kwan-Yee K. Wong

LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization

Add code
Jan 01, 2026
Viaarxiv icon

Unison: A Fully Automatic, Task-Universal, and Low-Cost Framework for Unified Understanding and Generation

Add code
Dec 08, 2025
Figure 1 for Unison: A Fully Automatic, Task-Universal, and Low-Cost Framework for Unified Understanding and Generation
Figure 2 for Unison: A Fully Automatic, Task-Universal, and Low-Cost Framework for Unified Understanding and Generation
Figure 3 for Unison: A Fully Automatic, Task-Universal, and Low-Cost Framework for Unified Understanding and Generation
Figure 4 for Unison: A Fully Automatic, Task-Universal, and Low-Cost Framework for Unified Understanding and Generation
Viaarxiv icon

Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers

Add code
Jun 09, 2025
Figure 1 for Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Figure 2 for Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Figure 3 for Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Figure 4 for Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Viaarxiv icon

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

Add code
Apr 27, 2025
Viaarxiv icon

VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models

Add code
Jan 21, 2025
Figure 1 for VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models
Figure 2 for VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models
Figure 3 for VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models
Figure 4 for VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models
Viaarxiv icon

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Add code
Oct 25, 2024
Figure 1 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 2 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 3 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 4 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Viaarxiv icon

BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities

Add code
Oct 18, 2024
Figure 1 for BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Figure 2 for BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Figure 3 for BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Figure 4 for BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Viaarxiv icon

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Add code
Oct 09, 2024
Figure 1 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 2 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 3 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 4 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Viaarxiv icon

ArtiFade: Learning to Generate High-quality Subject from Blemished Images

Add code
Sep 05, 2024
Figure 1 for ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Figure 2 for ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Figure 3 for ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Figure 4 for ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Viaarxiv icon

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Add code
Jul 09, 2024
Figure 1 for ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Figure 2 for ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Figure 3 for ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Figure 4 for ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Viaarxiv icon