Image


MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial Guidance

Add code
Feb 08, 2026
Viaarxiv icon

VFace: A Training-Free Approach for Diffusion-Based Video Face Swapping

Add code
Feb 08, 2026
Viaarxiv icon

DINO-Mix: Distilling Foundational Knowledge with Cross-Domain CutMix for Semi-supervised Class-imbalanced Medical Image Segmentation

Add code
Feb 08, 2026
Viaarxiv icon

MaD-Mix: Multi-Modal Data Mixtures via Latent Space Coupling for Vision-Language Model Training

Add code
Feb 08, 2026
Viaarxiv icon

A hybrid Kolmogorov-Arnold network for medical image segmentation

Add code
Feb 07, 2026
Viaarxiv icon

Vision and language: Novel Representations and Artificial intelligence for Driving Scene Safety Assessment and Autonomous Vehicle Planning

Add code
Feb 07, 2026
Viaarxiv icon

SciClaimEval: Cross-modal Claim Verification in Scientific Papers

Add code
Feb 07, 2026
Viaarxiv icon

Visualizing the Invisible: Enhancing Radiologist Performance in Breast Mammography via Task-Driven Chromatic Encoding

Add code
Feb 07, 2026
Viaarxiv icon

Cross-Camera Cow Identification via Disentangled Representation Learning

Add code
Feb 07, 2026
Viaarxiv icon

FlexID: Training-Free Flexible Identity Injection via Intent-Aware Modulation for Text-to-Image Generation

Add code
Feb 07, 2026
Viaarxiv icon