Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

MPath: Multimodal Pathology Report Generation from Whole Slide Images

Add code
Dec 10, 2025
Viaarxiv icon

Zero-Shot Textual Explanations via Translating Decision-Critical Features

Add code
Dec 08, 2025
Viaarxiv icon

MR Fingerprinting for Imaging Brain Hemodynamics and Oxygenation

Add code
Dec 15, 2025
Viaarxiv icon

Bitbox: Behavioral Imaging Toolbox for Computational Analysis of Behavior from Videos

Add code
Dec 19, 2025
Figure 1 for Bitbox: Behavioral Imaging Toolbox for Computational Analysis of Behavior from Videos
Figure 2 for Bitbox: Behavioral Imaging Toolbox for Computational Analysis of Behavior from Videos
Figure 3 for Bitbox: Behavioral Imaging Toolbox for Computational Analysis of Behavior from Videos
Figure 4 for Bitbox: Behavioral Imaging Toolbox for Computational Analysis of Behavior from Videos
Viaarxiv icon

Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images

Add code
Dec 18, 2025
Figure 1 for Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images
Figure 2 for Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images
Figure 3 for Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images
Figure 4 for Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images
Viaarxiv icon

Parameter Efficient Multimodal Instruction Tuning for Romanian Vision Language Models

Add code
Dec 16, 2025
Figure 1 for Parameter Efficient Multimodal Instruction Tuning for Romanian Vision Language Models
Figure 2 for Parameter Efficient Multimodal Instruction Tuning for Romanian Vision Language Models
Figure 3 for Parameter Efficient Multimodal Instruction Tuning for Romanian Vision Language Models
Figure 4 for Parameter Efficient Multimodal Instruction Tuning for Romanian Vision Language Models
Viaarxiv icon

AMD-HookNet++: Evolution of AMD-HookNet with Hybrid CNN-Transformer Feature Enhancement for Glacier Calving Front Segmentation

Add code
Dec 16, 2025
Viaarxiv icon

Towards Scalable Pre-training of Visual Tokenizers for Generation

Add code
Dec 15, 2025
Viaarxiv icon

VFMF: World Modeling by Forecasting Vision Foundation Model Features

Add code
Dec 12, 2025
Figure 1 for VFMF: World Modeling by Forecasting Vision Foundation Model Features
Figure 2 for VFMF: World Modeling by Forecasting Vision Foundation Model Features
Figure 3 for VFMF: World Modeling by Forecasting Vision Foundation Model Features
Figure 4 for VFMF: World Modeling by Forecasting Vision Foundation Model Features
Viaarxiv icon

SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation

Add code
Dec 11, 2025
Figure 1 for SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Figure 2 for SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Figure 3 for SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Figure 4 for SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Viaarxiv icon