Picture for Bo Peng

Bo Peng

Analyzing Fine-Grained Alignment and Enhancing Vision Understanding in Multimodal Language Models

Add code
May 22, 2025
Viaarxiv icon

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Add code
May 22, 2025
Viaarxiv icon

Planning with Diffusion Models for Target-Oriented Dialogue Systems

Add code
Apr 23, 2025
Viaarxiv icon

Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection

Add code
Apr 19, 2025
Viaarxiv icon

Rethinking Target Label Conditioning in Adversarial Attacks: A 2D Tensor-Guided Generative Approach

Add code
Apr 19, 2025
Viaarxiv icon

Boosting Multi-View Stereo with Depth Foundation Model in the Absence of Real-World Labels

Add code
Apr 16, 2025
Viaarxiv icon

GraphTEN: Graph Enhanced Texture Encoding Network

Add code
Mar 18, 2025
Viaarxiv icon

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Add code
Mar 18, 2025
Viaarxiv icon

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

Add code
Mar 12, 2025
Viaarxiv icon

PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation

Add code
Mar 09, 2025
Viaarxiv icon