Picture for Leonid Sigal

Leonid Sigal

On Pre-training of Multimodal Language Models Customized for Chart Understanding

Add code
Jul 19, 2024
Viaarxiv icon

Representing Animatable Avatar via Factorized Neural Fields

Add code
Jun 02, 2024
Figure 1 for Representing Animatable Avatar via Factorized Neural Fields
Figure 2 for Representing Animatable Avatar via Factorized Neural Fields
Figure 3 for Representing Animatable Avatar via Factorized Neural Fields
Figure 4 for Representing Animatable Avatar via Factorized Neural Fields
Viaarxiv icon

Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Add code
Apr 17, 2024
Figure 1 for Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Figure 2 for Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Figure 3 for Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Figure 4 for Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach
Viaarxiv icon

Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection

Add code
Mar 21, 2024
Figure 1 for Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection
Figure 2 for Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection
Figure 3 for Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection
Figure 4 for Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection
Viaarxiv icon

Visual Concept-driven Image Generation with Text-to-Image Diffusion Model

Add code
Feb 18, 2024
Viaarxiv icon

Multi-modal News Understanding with Professionally Labelled Videos (ReutersViLNews)

Add code
Jan 23, 2024
Viaarxiv icon

Joint Generative Modeling of Scene Graphs and Images via Diffusion Models

Add code
Jan 02, 2024
Viaarxiv icon

Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models

Add code
Dec 19, 2023
Viaarxiv icon

M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking

Add code
Dec 13, 2023
Figure 1 for M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking
Figure 2 for M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking
Figure 3 for M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking
Figure 4 for M3T: Multi-Scale Memory Matching for Video Object Segmentation and Tracking
Viaarxiv icon

TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models

Add code
Dec 03, 2023
Figure 1 for TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Figure 2 for TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Figure 3 for TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Figure 4 for TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models
Viaarxiv icon