Picture for Gabriela Ben Melech Stan

Gabriela Ben Melech Stan

LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models

Add code
Apr 03, 2024
Figure 1 for LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models
Figure 2 for LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models
Figure 3 for LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models
Figure 4 for LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models
Viaarxiv icon

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Add code
Apr 01, 2024
Figure 1 for Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Figure 2 for Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Figure 3 for Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Figure 4 for Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Viaarxiv icon

LDM3D-VR: Latent Diffusion Model for 3D VR

Add code
Nov 06, 2023
Viaarxiv icon

LDM3D: Latent Diffusion Model for 3D

Add code
May 21, 2023
Figure 1 for LDM3D: Latent Diffusion Model for 3D
Figure 2 for LDM3D: Latent Diffusion Model for 3D
Figure 3 for LDM3D: Latent Diffusion Model for 3D
Figure 4 for LDM3D: Latent Diffusion Model for 3D
Viaarxiv icon

Improving video retrieval using multilingual knowledge transfer

Add code
Aug 28, 2022
Figure 1 for Improving video retrieval using multilingual knowledge transfer
Figure 2 for Improving video retrieval using multilingual knowledge transfer
Figure 3 for Improving video retrieval using multilingual knowledge transfer
Figure 4 for Improving video retrieval using multilingual knowledge transfer
Viaarxiv icon