Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

Time-Correlated Video Bridge Matching

Add code
Oct 14, 2025
Viaarxiv icon

DistillMatch: Leveraging Knowledge Distillation from Vision Foundation Model for Multimodal Image Matching

Add code
Sep 19, 2025
Viaarxiv icon

REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework

Add code
Aug 23, 2025
Figure 1 for REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework
Figure 2 for REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework
Figure 3 for REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework
Figure 4 for REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework
Viaarxiv icon

Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization

Add code
Oct 08, 2025
Figure 1 for Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization
Figure 2 for Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization
Figure 3 for Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization
Figure 4 for Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization
Viaarxiv icon

Multilingual Vision-Language Models, A Survey

Add code
Sep 26, 2025
Viaarxiv icon

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Add code
Oct 01, 2025
Figure 1 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 2 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 3 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 4 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Viaarxiv icon

COCO-Urdu: A Large-Scale Urdu Image-Caption Dataset with Multimodal Quality Estimation

Add code
Sep 10, 2025
Viaarxiv icon

GP3: A 3D Geometry-Aware Policy with Multi-View Images for Robotic Manipulation

Add code
Sep 19, 2025
Viaarxiv icon

XOCT: Enhancing OCT to OCTA Translation via Cross-Dimensional Supervised Multi-Scale Feature Learning

Add code
Sep 09, 2025
Viaarxiv icon

Image-Guided Surgery: Technology, Quality, Innovation, and Opportunities for Medical Physics

Add code
Sep 03, 2025
Viaarxiv icon