Picture for Jinlong Peng

Jinlong Peng

CareCom: Generative Image Composition with Calibrated Reference Features

Add code
Nov 14, 2025
Viaarxiv icon

Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection

Add code
Oct 30, 2025
Figure 1 for Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection
Figure 2 for Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection
Figure 3 for Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection
Figure 4 for Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection
Viaarxiv icon

Swin DiT: Diffusion Transformer using Pseudo Shifted Windows

Add code
May 19, 2025
Viaarxiv icon

VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Add code
May 06, 2025
Viaarxiv icon

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

Add code
Mar 12, 2025
Figure 1 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 2 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 3 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 4 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Viaarxiv icon

PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation

Add code
Mar 09, 2025
Viaarxiv icon

DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation

Add code
Dec 04, 2024
Figure 1 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 2 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 3 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 4 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Viaarxiv icon

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

Add code
Nov 22, 2024
Figure 1 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 2 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 3 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 4 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Viaarxiv icon

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

Add code
Sep 16, 2024
Viaarxiv icon

VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis

Add code
Sep 12, 2024
Figure 1 for VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
Figure 2 for VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
Figure 3 for VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
Figure 4 for VI3DRM:Towards meticulous 3D Reconstruction from Sparse Views via Photo-Realistic Novel View Synthesis
Viaarxiv icon