Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Add code
Mar 10, 2026
Viaarxiv icon

When Minor Edits Matter: LLM-Driven Prompt Attack for Medical VLM Robustness in Ultrasound

Add code
Mar 22, 2026
Viaarxiv icon

Fanar 2.0: Arabic Generative AI Stack

Add code
Mar 17, 2026
Viaarxiv icon

BLOCK: An Open-Source Bi-Stage MLLM Character-to-Skin Pipeline for Minecraft

Add code
Mar 04, 2026
Viaarxiv icon

LoGSAM: Parameter-Efficient Cross-Modal Grounding for MRI Segmentation

Add code
Mar 18, 2026
Viaarxiv icon

Taming Vision Priors for Data Efficient mmWave Channel Modeling

Add code
Mar 11, 2026
Viaarxiv icon

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

Add code
Mar 19, 2026
Viaarxiv icon

Investigating a Policy-Based Formulation for Endoscopic Camera Pose Recovery

Add code
Mar 20, 2026
Viaarxiv icon

RAFM: Retrieval-Augmented Flow Matching for Unpaired CBCT-to-CT Translation

Add code
Feb 28, 2026
Viaarxiv icon

RealWonder: Real-Time Physical Action-Conditioned Video Generation

Add code
Mar 05, 2026
Viaarxiv icon