Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

Regional Attention-Enhanced Swin Transformer for Clinically Relevant Medical Image Captioning

Add code
Nov 13, 2025
Viaarxiv icon

DT-NVS: Diffusion Transformers for Novel View Synthesis

Add code
Nov 11, 2025
Viaarxiv icon

QSMnet-INR: Single-Orientation Quantitative Susceptibility Mapping via Implicit Neural Representation in k-Space

Add code
Dec 10, 2025
Viaarxiv icon

CrochetBench: Can Vision-Language Models Move from Describing to Doing in Crochet Domain?

Add code
Nov 12, 2025
Figure 1 for CrochetBench: Can Vision-Language Models Move from Describing to Doing in Crochet Domain?
Figure 2 for CrochetBench: Can Vision-Language Models Move from Describing to Doing in Crochet Domain?
Figure 3 for CrochetBench: Can Vision-Language Models Move from Describing to Doing in Crochet Domain?
Figure 4 for CrochetBench: Can Vision-Language Models Move from Describing to Doing in Crochet Domain?
Viaarxiv icon

TraceTrans: Translation and Spatial Tracing for Surgical Prediction

Add code
Oct 25, 2025
Viaarxiv icon

Building Trust in Virtual Immunohistochemistry: Automated Assessment of Image Quality

Add code
Nov 06, 2025
Figure 1 for Building Trust in Virtual Immunohistochemistry: Automated Assessment of Image Quality
Figure 2 for Building Trust in Virtual Immunohistochemistry: Automated Assessment of Image Quality
Figure 3 for Building Trust in Virtual Immunohistochemistry: Automated Assessment of Image Quality
Figure 4 for Building Trust in Virtual Immunohistochemistry: Automated Assessment of Image Quality
Viaarxiv icon

FlowRoI A Fast Optical Flow Driven Region of Interest Extraction Framework for High-Throughput Image Compression in Immune Cell Migration Analysis

Add code
Nov 18, 2025
Viaarxiv icon

Towards Rotation-only Imaging Geometry: Rotation Estimation

Add code
Nov 16, 2025
Viaarxiv icon

Towards Metric-Aware Multi-Person Mesh Recovery by Jointly Optimizing Human Crowd in Camera Space

Add code
Nov 17, 2025
Viaarxiv icon

Fractional neural attention for efficient multiscale sequence processing

Add code
Nov 13, 2025
Viaarxiv icon