Picture for Leonid Sigal

Leonid Sigal

On Pre-training of Multimodal Language Models Customized for Chart Understanding

Add code
Jul 19, 2024
Figure 1 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 2 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 3 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 4 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Viaarxiv icon

Representing Animatable Avatar via Factorized Neural Fields

Add code
Jun 02, 2024
Viaarxiv icon

Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Add code
Apr 17, 2024
Viaarxiv icon

Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection

Add code
Mar 21, 2024
Viaarxiv icon

Visual Concept-driven Image Generation with Text-to-Image Diffusion Model

Add code
Feb 18, 2024
Figure 1 for Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Figure 2 for Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Figure 3 for Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Figure 4 for Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Viaarxiv icon

Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)

Add code
Jan 23, 2024
Figure 1 for Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Figure 2 for Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Figure 3 for Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Figure 4 for Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)
Viaarxiv icon

Joint Generative Modeling of Scene Graphs and Images via Diffusion Models

Add code
Jan 02, 2024
Figure 1 for Joint Generative Modeling of Scene Graphs and Images via Diffusion Models
Figure 2 for Joint Generative Modeling of Scene Graphs and Images via Diffusion Models
Figure 3 for Joint Generative Modeling of Scene Graphs and Images via Diffusion Models
Figure 4 for Joint Generative Modeling of Scene Graphs and Images via Diffusion Models
Viaarxiv icon

Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models

Add code
Dec 19, 2023
Figure 1 for Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models
Figure 2 for Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models
Figure 3 for Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models
Figure 4 for Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models
Viaarxiv icon

M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking

Add code
Dec 13, 2023
Figure 1 for M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking
Figure 2 for M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking
Figure 3 for M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking
Figure 4 for M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking
Viaarxiv icon

TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models

Add code
Dec 03, 2023
Figure 1 for TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Figure 2 for TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Figure 3 for TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Figure 4 for TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Viaarxiv icon