Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model

Add code
Sep 17, 2025
Figure 1 for GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model
Figure 2 for GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model
Figure 3 for GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model
Figure 4 for GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model
Viaarxiv icon

Bangla-Bayanno: A 52K-Pair Bengali Visual Question Answering Dataset with LLM-Assisted Translation Refinement

Add code
Aug 27, 2025
Figure 1 for Bangla-Bayanno: A 52K-Pair Bengali Visual Question Answering Dataset with LLM-Assisted Translation Refinement
Figure 2 for Bangla-Bayanno: A 52K-Pair Bengali Visual Question Answering Dataset with LLM-Assisted Translation Refinement
Figure 3 for Bangla-Bayanno: A 52K-Pair Bengali Visual Question Answering Dataset with LLM-Assisted Translation Refinement
Figure 4 for Bangla-Bayanno: A 52K-Pair Bengali Visual Question Answering Dataset with LLM-Assisted Translation Refinement
Viaarxiv icon

Single-to-mix Modality Alignment with Multimodal Large Language Model for Document Image Machine Translation

Add code
Jul 10, 2025
Viaarxiv icon

CoSwin: Convolution Enhanced Hierarchical Shifted Window Attention For Small-Scale Vision

Add code
Sep 10, 2025
Viaarxiv icon

No Masks Needed: Explainable AI for Deriving Segmentation from Classification

Add code
Aug 06, 2025
Viaarxiv icon

Integrating Anatomical Priors into a Causal Diffusion Model

Add code
Sep 10, 2025
Figure 1 for Integrating Anatomical Priors into a Causal Diffusion Model
Figure 2 for Integrating Anatomical Priors into a Causal Diffusion Model
Figure 3 for Integrating Anatomical Priors into a Causal Diffusion Model
Figure 4 for Integrating Anatomical Priors into a Causal Diffusion Model
Viaarxiv icon

Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models

Add code
Aug 14, 2025
Viaarxiv icon

SIDA: Synthetic Image Driven Zero-shot Domain Adaptation

Add code
Jul 24, 2025
Viaarxiv icon

Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation

Add code
Aug 08, 2025
Figure 1 for Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation
Figure 2 for Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation
Figure 3 for Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation
Figure 4 for Roll Your Eyes: Gaze Redirection via Explicit 3D Eyeball Rotation
Viaarxiv icon

CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities

Add code
Aug 20, 2025
Viaarxiv icon