Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

MPM: Mutual Pair Merging for Efficient Vision Transformers

Add code
Apr 07, 2026
Viaarxiv icon

InCaRPose: In-Cabin Relative Camera Pose Estimation Model and Dataset

Add code
Apr 04, 2026
Viaarxiv icon

Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models

Add code
Apr 02, 2026
Viaarxiv icon

Sim2Real-AD: A Modular Sim-to-Real Framework for Deploying VLM-Guided Reinforcement Learning in Real-World Autonomous Driving

Add code
Apr 03, 2026
Viaarxiv icon

Structured Observation Language for Efficient and Generalizable Vision-Language Navigation

Add code
Mar 29, 2026
Viaarxiv icon

Structural Evaluation Metrics for SVG Generation via Leave-One-Out Analysis

Add code
Apr 09, 2026
Viaarxiv icon

MEDIC-AD: Towards Medical Vision-Language Model's Clinical Intelligence

Add code
Mar 28, 2026
Viaarxiv icon

HistDiT: A Structure-Aware Latent Conditional Diffusion Model for High-Fidelity Virtual Staining in Histopathology

Add code
Apr 09, 2026
Viaarxiv icon

Three Modalities, Two Design Probes, One Prototype, and No Vision: Experience-Based Co-Design of a Multi-modal 3D Data Visualization Tool

Add code
Apr 10, 2026
Viaarxiv icon

M2H-MX: Multi-Task Dense Visual Perception for Real-Time Monocular Spatial Understanding

Add code
Mar 31, 2026
Viaarxiv icon