Picture for Xu Peng

Xu Peng

FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing

Add code
Jan 06, 2026
Viaarxiv icon

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

Add code
Dec 15, 2025
Viaarxiv icon

OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research

Add code
Oct 30, 2025
Viaarxiv icon

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors

Add code
Feb 20, 2025
Figure 1 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 2 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 3 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 4 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Viaarxiv icon

Oracle Bone Inscriptions Multi-modal Dataset

Add code
Jul 04, 2024
Figure 1 for Oracle Bone Inscriptions Multi-modal Dataset
Figure 2 for Oracle Bone Inscriptions Multi-modal Dataset
Viaarxiv icon

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Add code
Mar 10, 2024
Viaarxiv icon

PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization

Add code
Dec 11, 2023
Figure 1 for PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Figure 2 for PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Figure 3 for PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Figure 4 for PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization
Viaarxiv icon