Image


Driving with DINO: Vision Foundation Features as a Unified Bridge for Sim-to-Real Generation in Autonomous Driving

Add code
Feb 09, 2026
Viaarxiv icon

Geometric Image Editing via Effects-Sensitive In-Context Inpainting with Diffusion Transformers

Add code
Feb 09, 2026
Viaarxiv icon

Moving Beyond Functional Connectivity: Time-Series Modeling for fMRI-Based Brain Disorder Classification

Add code
Feb 09, 2026
Viaarxiv icon

From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

CLUE: Crossmodal disambiguation via Language-vision Understanding with attEntion

Add code
Feb 09, 2026
Viaarxiv icon

Understanding and Optimizing Attention-Based Sparse Matching for Diverse Local Features

Add code
Feb 09, 2026
Viaarxiv icon

FusionEdit: Semantic Fusion and Attention Modulation for Training-Free Image Editing

Add code
Feb 09, 2026
Viaarxiv icon

A Unified Framework for Multimodal Image Reconstruction and Synthesis using Denoising Diffusion Models

Add code
Feb 09, 2026
Viaarxiv icon

Grow with the Flow: 4D Reconstruction of Growing Plants with Gaussian Flow Fields

Add code
Feb 09, 2026
Viaarxiv icon

Autoregressive Image Generation with Masked Bit Modeling

Add code
Feb 09, 2026
Viaarxiv icon