Picture for Anja Hauth

Anja Hauth

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Add code
Dec 21, 2023
Figure 1 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 2 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 3 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 4 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Learning Audio-Video Modalities from Image Captions

Add code
Apr 01, 2022
Figure 1 for Learning Audio-Video Modalities from Image Captions
Figure 2 for Learning Audio-Video Modalities from Image Captions
Figure 3 for Learning Audio-Video Modalities from Image Captions
Figure 4 for Learning Audio-Video Modalities from Image Captions
Viaarxiv icon

Fast Task-Aware Architecture Inference

Add code
Feb 15, 2019
Figure 1 for Fast Task-Aware Architecture Inference
Figure 2 for Fast Task-Aware Architecture Inference
Figure 3 for Fast Task-Aware Architecture Inference
Figure 4 for Fast Task-Aware Architecture Inference
Viaarxiv icon