Picture for Fabian Caba Heilbron

Fabian Caba Heilbron

Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets

Add code
Sep 02, 2024
Viaarxiv icon

Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval

Add code
May 06, 2024
Viaarxiv icon

Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

Add code
Apr 05, 2024
Figure 1 for Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Figure 2 for Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Figure 3 for Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Figure 4 for Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Viaarxiv icon

Towards Automated Movie Trailer Generation

Add code
Apr 04, 2024
Figure 1 for Towards Automated Movie Trailer Generation
Figure 2 for Towards Automated Movie Trailer Generation
Figure 3 for Towards Automated Movie Trailer Generation
Figure 4 for Towards Automated Movie Trailer Generation
Viaarxiv icon

Scaling Up Video Summarization Pretraining with Large Language Models

Add code
Apr 04, 2024
Figure 1 for Scaling Up Video Summarization Pretraining with Large Language Models
Figure 2 for Scaling Up Video Summarization Pretraining with Large Language Models
Figure 3 for Scaling Up Video Summarization Pretraining with Large Language Models
Figure 4 for Scaling Up Video Summarization Pretraining with Large Language Models
Viaarxiv icon

Long-range Multimodal Pretraining for Movie Understanding

Add code
Aug 18, 2023
Figure 1 for Long-range Multimodal Pretraining for Movie Understanding
Figure 2 for Long-range Multimodal Pretraining for Movie Understanding
Figure 3 for Long-range Multimodal Pretraining for Movie Understanding
Figure 4 for Long-range Multimodal Pretraining for Movie Understanding
Viaarxiv icon

Meta-Personalizing Vision-Language Models to Find Named Instances in Video

Add code
Jun 16, 2023
Figure 1 for Meta-Personalizing Vision-Language Models to Find Named Instances in Video
Figure 2 for Meta-Personalizing Vision-Language Models to Find Named Instances in Video
Figure 3 for Meta-Personalizing Vision-Language Models to Find Named Instances in Video
Figure 4 for Meta-Personalizing Vision-Language Models to Find Named Instances in Video
Viaarxiv icon

Localizing Moments in Long Video Via Multimodal Guidance

Add code
Feb 26, 2023
Figure 1 for Localizing Moments in Long Video Via Multimodal Guidance
Figure 2 for Localizing Moments in Long Video Via Multimodal Guidance
Figure 3 for Localizing Moments in Long Video Via Multimodal Guidance
Figure 4 for Localizing Moments in Long Video Via Multimodal Guidance
Viaarxiv icon

PIVOT: Prompting for Video Continual Learning

Add code
Dec 09, 2022
Figure 1 for PIVOT: Prompting for Video Continual Learning
Figure 2 for PIVOT: Prompting for Video Continual Learning
Figure 3 for PIVOT: Prompting for Video Continual Learning
Figure 4 for PIVOT: Prompting for Video Continual Learning
Viaarxiv icon

VideoMap: Video Editing in Latent Space

Add code
Nov 22, 2022
Figure 1 for VideoMap: Video Editing in Latent Space
Figure 2 for VideoMap: Video Editing in Latent Space
Figure 3 for VideoMap: Video Editing in Latent Space
Figure 4 for VideoMap: Video Editing in Latent Space
Viaarxiv icon