photo


TASTE: A Designer-Annotated Multi-Dimensional Preference Dataset for AI-Generated Graphic Design

Add code
May 20, 2026
Viaarxiv icon

Multi-axis Analysis of Image Manipulation Localization

Add code
May 19, 2026
Viaarxiv icon

Cross-View Splatter: Feed-Forward View Synthesis with Georeferenced Images

Add code
May 19, 2026
Viaarxiv icon

Personalized Face Privacy Protection From a Single Image

Add code
May 18, 2026
Viaarxiv icon

MARS: Technical Report for the CASTLE Challenge at EgoVis 2026

Add code
May 18, 2026
Viaarxiv icon

Unlocking UML Class Diagram Understanding in Vision Language Models

Add code
May 12, 2026
Viaarxiv icon

3D Primitives are a Spatial Language for VLMs

Add code
May 12, 2026
Viaarxiv icon

MFVLR: Multi-domain Fine-grained Vision-Language Reconstruction for Generalizable Diffusion Face Forgery Detection and Localization

Add code
May 11, 2026
Viaarxiv icon

DPM++: Dynamic Masked Metric Learning for Occluded Person Re-identification

Add code
May 07, 2026
Viaarxiv icon

Leveraging Image Generators to Address Training Data Scarcity: The Gen4Regen Dataset for Forest Regeneration Mapping

Add code
May 07, 2026
Viaarxiv icon