Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

Multi-Domain Brain Vessel Segmentation Through Feature Disentanglement

Add code
Oct 02, 2025
Figure 1 for Multi-Domain Brain Vessel Segmentation Through Feature Disentanglement
Figure 2 for Multi-Domain Brain Vessel Segmentation Through Feature Disentanglement
Figure 3 for Multi-Domain Brain Vessel Segmentation Through Feature Disentanglement
Figure 4 for Multi-Domain Brain Vessel Segmentation Through Feature Disentanglement
Viaarxiv icon

RaceGAN: A Framework for Preserving Individuality while Converting Racial Information for Image-to-Image Translation

Add code
Sep 18, 2025
Viaarxiv icon

Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces

Add code
Sep 30, 2025
Figure 1 for Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
Figure 2 for Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
Figure 3 for Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
Figure 4 for Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces
Viaarxiv icon

Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance

Add code
Sep 26, 2025
Figure 1 for Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
Figure 2 for Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
Figure 3 for Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
Figure 4 for Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance
Viaarxiv icon

Transport Based Mean Flows for Generative Modeling

Add code
Sep 26, 2025
Figure 1 for Transport Based Mean Flows for Generative Modeling
Figure 2 for Transport Based Mean Flows for Generative Modeling
Figure 3 for Transport Based Mean Flows for Generative Modeling
Figure 4 for Transport Based Mean Flows for Generative Modeling
Viaarxiv icon

Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss

Add code
Sep 26, 2025
Figure 1 for Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss
Figure 2 for Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss
Figure 3 for Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss
Figure 4 for Deep Learning-Based Cross-Anatomy CT Synthesis Using Adapted nnResU-Net with Anatomical Feature Prioritized Loss
Viaarxiv icon

AREPAS: Anomaly Detection in Fine-Grained Anatomy with Reconstruction-Based Semantic Patch-Scoring

Add code
Sep 16, 2025
Figure 1 for AREPAS: Anomaly Detection in Fine-Grained Anatomy with Reconstruction-Based Semantic Patch-Scoring
Figure 2 for AREPAS: Anomaly Detection in Fine-Grained Anatomy with Reconstruction-Based Semantic Patch-Scoring
Figure 3 for AREPAS: Anomaly Detection in Fine-Grained Anatomy with Reconstruction-Based Semantic Patch-Scoring
Figure 4 for AREPAS: Anomaly Detection in Fine-Grained Anatomy with Reconstruction-Based Semantic Patch-Scoring
Viaarxiv icon

PRIM: Towards Practical In-Image Multilingual Machine Translation

Add code
Sep 05, 2025
Figure 1 for PRIM: Towards Practical In-Image Multilingual Machine Translation
Figure 2 for PRIM: Towards Practical In-Image Multilingual Machine Translation
Figure 3 for PRIM: Towards Practical In-Image Multilingual Machine Translation
Figure 4 for PRIM: Towards Practical In-Image Multilingual Machine Translation
Viaarxiv icon

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Add code
Sep 19, 2025
Figure 1 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 2 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 3 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 4 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Viaarxiv icon

Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion

Add code
Oct 06, 2025
Figure 1 for Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion
Figure 2 for Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion
Figure 3 for Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion
Figure 4 for Bidirectional Mammogram View Translation with Column-Aware and Implicit 3D Conditional Diffusion
Viaarxiv icon