photo


CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion

Add code
Jan 15, 2025
Figure 1 for CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion
Figure 2 for CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion
Figure 3 for CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion
Figure 4 for CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion
Viaarxiv icon

Make-A-Character 2: Animatable 3D Character Generation From a Single Image

Add code
Jan 15, 2025
Figure 1 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 2 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 3 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 4 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Viaarxiv icon

CureGraph: Contrastive Multi-Modal Graph Representation Learning for Urban Living Circle Health Profiling and Prediction

Add code
Jan 13, 2025
Viaarxiv icon

Learning Implicit Social Navigation Behavior using Deep Inverse Reinforcement Learning

Add code
Jan 12, 2025
Viaarxiv icon

Discovering an Image-Adaptive Coordinate System for Photography Processing

Add code
Jan 11, 2025
Viaarxiv icon

Underwater Image Enhancement using Generative Adversarial Networks: A Survey

Add code
Jan 10, 2025
Viaarxiv icon

Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants

Add code
Jan 05, 2025
Figure 1 for Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
Figure 2 for Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
Figure 3 for Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
Figure 4 for Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants
Viaarxiv icon

DreamDrive: Generative 4D Scene Modeling from Street View Images

Add code
Jan 03, 2025
Figure 1 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 2 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 3 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Figure 4 for DreamDrive: Generative 4D Scene Modeling from Street View Images
Viaarxiv icon

Ingredients: Blending Custom Photos with Video Diffusion Transformers

Add code
Jan 03, 2025
Figure 1 for Ingredients: Blending Custom Photos with Video Diffusion Transformers
Figure 2 for Ingredients: Blending Custom Photos with Video Diffusion Transformers
Figure 3 for Ingredients: Blending Custom Photos with Video Diffusion Transformers
Figure 4 for Ingredients: Blending Custom Photos with Video Diffusion Transformers
Viaarxiv icon

A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia

Add code
Jan 01, 2025
Figure 1 for A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia
Figure 2 for A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia
Figure 3 for A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia
Viaarxiv icon