photo


SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes

Add code
May 24, 2025
Viaarxiv icon

Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Add code
May 24, 2025
Viaarxiv icon

PawPrint: Whose Footprints Are These? Identifying Animal Individuals by Their Footprints

Add code
May 23, 2025
Viaarxiv icon

Understanding Generative AI Capabilities in Everyday Image Editing Tasks

Add code
May 22, 2025
Viaarxiv icon

Seeing through Satellite Images at Street Views

Add code
May 22, 2025
Viaarxiv icon

UAV See, UGV Do: Aerial Imagery and Virtual Teach Enabling Zero-Shot Ground Vehicle Repeat

Add code
May 22, 2025
Viaarxiv icon

Depth Transfer: Learning to See Like a Simulator for Real-World Drone Navigation

Add code
May 18, 2025
Viaarxiv icon

3D-Fixup: Advancing Photo Editing with 3D Priors

Add code
May 15, 2025
Viaarxiv icon

Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians

Add code
May 14, 2025
Viaarxiv icon

MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills

Add code
May 09, 2025
Viaarxiv icon