Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR

Add code
Aug 29, 2025
Figure 1 for Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Figure 2 for Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Figure 3 for Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Figure 4 for Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Viaarxiv icon

Dual-branch Prompting for Multimodal Machine Translation

Add code
Jul 23, 2025
Viaarxiv icon

ConlangCrafter: Constructing Languages with a Multi-Hop LLM Pipeline

Add code
Aug 08, 2025
Viaarxiv icon

DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal

Add code
Jul 02, 2025
Figure 1 for DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
Figure 2 for DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
Figure 3 for DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
Figure 4 for DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
Viaarxiv icon

CONCAP: Seeing Beyond English with Concepts Retrieval-Augmented Captioning

Add code
Jul 27, 2025
Viaarxiv icon

Advanced Multi-Architecture Deep Learning Framework for BIRADS-Based Mammographic Image Retrieval: Comprehensive Performance Analysis with Super-Ensemble Optimization

Add code
Aug 06, 2025
Viaarxiv icon

MetaCLIP 2: A Worldwide Scaling Recipe

Add code
Jul 29, 2025
Figure 1 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 2 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 3 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 4 for MetaCLIP 2: A Worldwide Scaling Recipe
Viaarxiv icon

Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis

Add code
Jul 09, 2025
Figure 1 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 2 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 3 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 4 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Viaarxiv icon

EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow

Add code
Jul 08, 2025
Figure 1 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 2 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 3 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 4 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Viaarxiv icon

Neural Concept Verifier: Scaling Prover-Verifier Games via Concept Encodings

Add code
Jul 10, 2025
Viaarxiv icon