Image To Image Translation


Image-to-image translation is the process of converting an image from one domain to another using deep learning techniques.

Designing Practical Models for Isolated Word Visual Speech Recognition

Add code
Aug 25, 2025
Viaarxiv icon

Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR

Add code
Aug 29, 2025
Figure 1 for Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Figure 2 for Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Figure 3 for Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Figure 4 for Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR
Viaarxiv icon

ConlangCrafter: Constructing Languages with a Multi-Hop LLM Pipeline

Add code
Aug 08, 2025
Viaarxiv icon

DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal

Add code
Jul 02, 2025
Figure 1 for DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
Figure 2 for DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
Figure 3 for DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
Figure 4 for DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
Viaarxiv icon

CONCAP: Seeing Beyond English with Concepts Retrieval-Augmented Captioning

Add code
Jul 27, 2025
Viaarxiv icon

Advanced Multi-Architecture Deep Learning Framework for BIRADS-Based Mammographic Image Retrieval: Comprehensive Performance Analysis with Super-Ensemble Optimization

Add code
Aug 06, 2025
Viaarxiv icon

MetaCLIP 2: A Worldwide Scaling Recipe

Add code
Jul 29, 2025
Figure 1 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 2 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 3 for MetaCLIP 2: A Worldwide Scaling Recipe
Figure 4 for MetaCLIP 2: A Worldwide Scaling Recipe
Viaarxiv icon

Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis

Add code
Jul 09, 2025
Figure 1 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 2 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 3 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Figure 4 for Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis
Viaarxiv icon

EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow

Add code
Jul 08, 2025
Figure 1 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 2 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 3 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 4 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Viaarxiv icon

Neural Concept Verifier: Scaling Prover-Verifier Games via Concept Encodings

Add code
Jul 10, 2025
Viaarxiv icon