Picture for Ekaterina Deyneka

Ekaterina Deyneka

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Add code
Feb 29, 2024
Figure 1 for Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Figure 2 for Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Figure 3 for Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Figure 4 for Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Viaarxiv icon

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

Add code
Feb 22, 2024
Figure 1 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 2 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 3 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 4 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Viaarxiv icon