photo


BetterScene: 3D Scene Synthesis with Representation-Aligned Generative Model

Add code
Feb 26, 2026
Viaarxiv icon

PhotoAgent: Agentic Photo Editing with Exploratory Visual Aesthetic Planning

Add code
Feb 26, 2026
Viaarxiv icon

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Add code
Feb 26, 2026
Viaarxiv icon

Scale Can't Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

Add code
Feb 26, 2026
Viaarxiv icon

How to Take a Memorable Picture? Empowering Users with Actionable Feedback

Add code
Feb 25, 2026
Viaarxiv icon

Indaleko: The Unified Personal Index

Add code
Feb 24, 2026
Viaarxiv icon

L3DR: 3D-aware LiDAR Diffusion and Rectification

Add code
Feb 22, 2026
Viaarxiv icon

NRGS-SLAM: Monocular Non-Rigid SLAM for Endoscopy via Deformation-Aware 3D Gaussian Splatting

Add code
Feb 19, 2026
Viaarxiv icon

Markerless 6D Pose Estimation and Position-Based Visual Servoing for Endoscopic Continuum Manipulators

Add code
Feb 18, 2026
Viaarxiv icon

Visual Persuasion: What Influences Decisions of Vision-Language Models?

Add code
Feb 17, 2026
Viaarxiv icon