Picture for Geoff Brown

Geoff Brown

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Non-Intrusive Adaptation: Input-Centric Parameter-efficient Fine-Tuning for Versatile Multimodal Modeling

Add code
Oct 18, 2023
Figure 1 for Non-Intrusive Adaptation: Input-Centric Parameter-efficient Fine-Tuning for Versatile Multimodal Modeling
Figure 2 for Non-Intrusive Adaptation: Input-Centric Parameter-efficient Fine-Tuning for Versatile Multimodal Modeling
Figure 3 for Non-Intrusive Adaptation: Input-Centric Parameter-efficient Fine-Tuning for Versatile Multimodal Modeling
Figure 4 for Non-Intrusive Adaptation: Input-Centric Parameter-efficient Fine-Tuning for Versatile Multimodal Modeling
Viaarxiv icon

WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset

Add code
May 09, 2023
Figure 1 for WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset
Figure 2 for WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset
Figure 3 for WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset
Viaarxiv icon

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

Add code
May 05, 2023
Figure 1 for A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding
Figure 2 for A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding
Figure 3 for A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding
Figure 4 for A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding
Viaarxiv icon