photo


TASTE: A Designer-Annotated Multi-Dimensional Preference Dataset for AI-Generated Graphic Design

Add code
May 20, 2026
Viaarxiv icon

Cross-View Splatter: Feed-Forward View Synthesis with Georeferenced Images

Add code
May 19, 2026
Viaarxiv icon

Multi-axis Analysis of Image Manipulation Localization

Add code
May 19, 2026
Viaarxiv icon

MARS: Technical Report for the CASTLE Challenge at EgoVis 2026

Add code
May 18, 2026
Viaarxiv icon

Personalized Face Privacy Protection From a Single Image

Add code
May 18, 2026
Viaarxiv icon

Unlocking UML Class Diagram Understanding in Vision Language Models

Add code
May 12, 2026
Viaarxiv icon

3D Primitives are a Spatial Language for VLMs

Add code
May 12, 2026
Viaarxiv icon

MFVLR: Multi-domain Fine-grained Vision-Language Reconstruction for Generalizable Diffusion Face Forgery Detection and Localization

Add code
May 11, 2026
Viaarxiv icon

DPM++: Dynamic Masked Metric Learning for Occluded Person Re-identification

Add code
May 07, 2026
Viaarxiv icon

Leveraging Image Generators to Address Training Data Scarcity: The Gen4Regen Dataset for Forest Regeneration Mapping

Add code
May 07, 2026
Viaarxiv icon