Picture for Boyuan Jiang

Boyuan Jiang

Kling-Omni Technical Report

Add code
Dec 18, 2025
Viaarxiv icon

KlingAvatar 2.0 Technical Report

Add code
Dec 15, 2025
Viaarxiv icon

Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance

Add code
Oct 14, 2025
Viaarxiv icon

Score Augmentation for Diffusion Models

Add code
Aug 11, 2025
Viaarxiv icon

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption

Add code
May 17, 2025
Figure 1 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 2 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 3 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Figure 4 for VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Viaarxiv icon

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors

Add code
Feb 20, 2025
Figure 1 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 2 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 3 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 4 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Viaarxiv icon

DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation

Add code
Dec 04, 2024
Figure 1 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 2 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 3 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 4 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Viaarxiv icon

Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing

Add code
Nov 26, 2024
Figure 1 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 2 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 3 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 4 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Viaarxiv icon

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

Add code
Nov 22, 2024
Figure 1 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 2 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 3 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 4 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Viaarxiv icon

VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing

Add code
Nov 22, 2024
Figure 1 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 2 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 3 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Figure 4 for VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing
Viaarxiv icon