Picture for Cordelia Schmid

Cordelia Schmid

Thoth

MoReVQA: Exploring Modular Reasoning Models for Video Question Answering

Add code
Apr 09, 2024
Viaarxiv icon

Learning Correlation Structures for Vision Transformers

Add code
Apr 05, 2024
Figure 1 for Learning Correlation Structures for Vision Transformers
Figure 2 for Learning Correlation Structures for Vision Transformers
Figure 3 for Learning Correlation Structures for Vision Transformers
Figure 4 for Learning Correlation Structures for Vision Transformers
Viaarxiv icon

SUGAR: Pre-training 3D Visual Representations for Robotics

Add code
Apr 01, 2024
Figure 1 for SUGAR: Pre-training 3D Visual Representations for Robotics
Figure 2 for SUGAR: Pre-training 3D Visual Representations for Robotics
Figure 3 for SUGAR: Pre-training 3D Visual Representations for Robotics
Figure 4 for SUGAR: Pre-training 3D Visual Representations for Robotics
Viaarxiv icon

Streaming Dense Video Captioning

Add code
Apr 01, 2024
Figure 1 for Streaming Dense Video Captioning
Figure 2 for Streaming Dense Video Captioning
Figure 3 for Streaming Dense Video Captioning
Figure 4 for Streaming Dense Video Captioning
Viaarxiv icon

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

Add code
Mar 04, 2024
Viaarxiv icon

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Add code
Mar 02, 2024
Figure 1 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 2 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 3 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 4 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Viaarxiv icon

Time-, Memory- and Parameter-Efficient Visual Adaptation

Add code
Feb 05, 2024
Viaarxiv icon

RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks

Add code
Jan 11, 2024
Viaarxiv icon

Pixel Aligned Language Models

Add code
Dec 14, 2023
Viaarxiv icon

Dense Optical Tracking: Connecting the Dots

Add code
Dec 07, 2023
Figure 1 for Dense Optical Tracking: Connecting the Dots
Figure 2 for Dense Optical Tracking: Connecting the Dots
Figure 3 for Dense Optical Tracking: Connecting the Dots
Figure 4 for Dense Optical Tracking: Connecting the Dots
Viaarxiv icon