Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Add code
Jan 08, 2026
Viaarxiv icon

Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment

Add code
Dec 22, 2025
Figure 1 for Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment
Figure 2 for Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment
Figure 3 for Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment
Figure 4 for Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment
Viaarxiv icon

Using Large Language Models To Translate Machine Results To Human Results

Add code
Dec 30, 2025
Viaarxiv icon

Aligning Findings with Diagnosis: A Self-Consistent Reinforcement Learning Framework for Trustworthy Radiology Reporting

Add code
Jan 06, 2026
Viaarxiv icon

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

Add code
Dec 31, 2025
Viaarxiv icon

The Color-Clinical Decoupling: Why Perceptual Calibration Fails Clinical Biomarkers in Smartphone Dermatology

Add code
Dec 26, 2025
Viaarxiv icon

Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models

Add code
Dec 19, 2025
Figure 1 for Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models
Figure 2 for Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models
Figure 3 for Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models
Figure 4 for Seeing Justice Clearly: Handwritten Legal Document Translation with OCR and Vision-Language Models
Viaarxiv icon

LVLM-Aided Alignment of Task-Specific Vision Models

Add code
Dec 26, 2025
Figure 1 for LVLM-Aided Alignment of Task-Specific Vision Models
Figure 2 for LVLM-Aided Alignment of Task-Specific Vision Models
Figure 3 for LVLM-Aided Alignment of Task-Specific Vision Models
Figure 4 for LVLM-Aided Alignment of Task-Specific Vision Models
Viaarxiv icon

The OCR-PT-CT Project: Semi-Automatic Recognition of Ancient Egyptian Hieroglyphs Based on Metric Learning

Add code
Dec 30, 2025
Viaarxiv icon

Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real

Add code
Dec 13, 2025
Viaarxiv icon