Image


Automatic regularization parameter choice for tomography using a double model approach

Add code
Feb 09, 2026
Viaarxiv icon

What, Whether and How? Unveiling Process Reward Models for Thinking with Images Reasoning

Add code
Feb 09, 2026
Viaarxiv icon

Aerial Manipulation with Contact-Aware Onboard Perception and Hybrid Control

Add code
Feb 09, 2026
Viaarxiv icon

Efficient-SAM2: Accelerating SAM2 with Object-Aware Visual Encoding and Memory Retrieval

Add code
Feb 09, 2026
Viaarxiv icon

Chain-of-Caption: Training-free improvement of multimodal large language model on referring expression comprehension

Add code
Feb 09, 2026
Viaarxiv icon

Geospatial-Reasoning-Driven Vocabulary-Agnostic Remote Sensing Semantic Segmentation

Add code
Feb 09, 2026
Viaarxiv icon

Geometric Image Editing via Effects-Sensitive In-Context Inpainting with Diffusion Transformers

Add code
Feb 09, 2026
Viaarxiv icon

From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

CLUE: Crossmodal disambiguation via Language-vision Understanding with attEntion

Add code
Feb 09, 2026
Viaarxiv icon

FusionEdit: Semantic Fusion and Attention Modulation for Training-Free Image Editing

Add code
Feb 09, 2026
Viaarxiv icon