Picture for Mojtaba Seyedhosseini

Mojtaba Seyedhosseini

Gemini Embedding: Generalizable Embeddings from Gemini

Add code
Mar 10, 2025
Viaarxiv icon

TIPS: Text-Image Pretraining with Spatial Awareness

Add code
Oct 21, 2024
Figure 1 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 2 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 3 for TIPS: Text-Image Pretraining with Spatial Awareness
Figure 4 for TIPS: Text-Image Pretraining with Spatial Awareness
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Improve Supervised Representation Learning with Masked Image Modeling

Add code
Dec 01, 2023
Figure 1 for Improve Supervised Representation Learning with Masked Image Modeling
Figure 2 for Improve Supervised Representation Learning with Masked Image Modeling
Figure 3 for Improve Supervised Representation Learning with Masked Image Modeling
Figure 4 for Improve Supervised Representation Learning with Masked Image Modeling
Viaarxiv icon

Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations

Add code
Sep 04, 2023
Viaarxiv icon

PaLI-X: On Scaling up a Multilingual Vision and Language Model

Add code
May 29, 2023
Figure 1 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 2 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 3 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 4 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Viaarxiv icon

PaLI: A Jointly-Scaled Multilingual Language-Image Model

Add code
Sep 16, 2022
Figure 1 for PaLI: A Jointly-Scaled Multilingual Language-Image Model
Figure 2 for PaLI: A Jointly-Scaled Multilingual Language-Image Model
Figure 3 for PaLI: A Jointly-Scaled Multilingual Language-Image Model
Figure 4 for PaLI: A Jointly-Scaled Multilingual Language-Image Model
Viaarxiv icon

CoCa: Contrastive Captioners are Image-Text Foundation Models

Add code
May 04, 2022
Figure 1 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 2 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 3 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 4 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Viaarxiv icon

Image Segmentation Using Hierarchical Merge Tree

Add code
Jul 31, 2016
Figure 1 for Image Segmentation Using Hierarchical Merge Tree
Figure 2 for Image Segmentation Using Hierarchical Merge Tree
Figure 3 for Image Segmentation Using Hierarchical Merge Tree
Figure 4 for Image Segmentation Using Hierarchical Merge Tree
Viaarxiv icon

Disjunctive Normal Networks

Add code
Dec 30, 2014
Figure 1 for Disjunctive Normal Networks
Figure 2 for Disjunctive Normal Networks
Figure 3 for Disjunctive Normal Networks
Figure 4 for Disjunctive Normal Networks
Viaarxiv icon