Alert button
Picture for Cordelia Schmid

Cordelia Schmid

Alert button

MoReVQA: Exploring Modular Reasoning Models for Video Question Answering

Add code
Bookmark button
Alert button
Apr 09, 2024
Juhong Min, Shyamal Buch, Arsha Nagrani, Minsu Cho, Cordelia Schmid

Viaarxiv icon

Learning Correlation Structures for Vision Transformers

Add code
Bookmark button
Alert button
Apr 05, 2024
Manjin Kim, Paul Hongsuck Seo, Cordelia Schmid, Minsu Cho

Viaarxiv icon

SUGAR: Pre-training 3D Visual Representations for Robotics

Add code
Bookmark button
Alert button
Apr 01, 2024
Shizhe Chen, Ricardo Garcia, Ivan Laptev, Cordelia Schmid

Viaarxiv icon

Streaming Dense Video Captioning

Add code
Bookmark button
Alert button
Apr 01, 2024
Xingyi Zhou, Anurag Arnab, Shyamal Buch, Shen Yan, Austin Myers, Xuehan Xiong, Arsha Nagrani, Cordelia Schmid

Viaarxiv icon

A Generative Approach for Wikipedia-Scale Visual Entity Recognition

Add code
Bookmark button
Alert button
Mar 04, 2024
Mathilde Caron, Ahmet Iscen, Alireza Fathi, Cordelia Schmid

Figure 1 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 2 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 3 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Figure 4 for A Generative Approach for Wikipedia-Scale Visual Entity Recognition
Viaarxiv icon

SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code

Add code
Bookmark button
Alert button
Mar 02, 2024
Ziniu Hu, Ahmet Iscen, Aashi Jain, Thomas Kipf, Yisong Yue, David A. Ross, Cordelia Schmid, Alireza Fathi

Figure 1 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 2 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 3 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Figure 4 for SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
Viaarxiv icon

Time-, Memory- and Parameter-Efficient Visual Adaptation

Add code
Bookmark button
Alert button
Feb 05, 2024
Otniel-Bogdan Mercea, Alexey Gritsenko, Cordelia Schmid, Anurag Arnab

Viaarxiv icon

RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks

Add code
Bookmark button
Alert button
Jan 11, 2024
Partha Ghosh, Soubhik Sanyal, Cordelia Schmid, Bernhard Schölkopf

Viaarxiv icon

Pixel Aligned Language Models

Add code
Bookmark button
Alert button
Dec 14, 2023
Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid

Viaarxiv icon

Dense Optical Tracking: Connecting the Dots

Add code
Bookmark button
Alert button
Dec 07, 2023
Guillaume Le Moing, Jean Ponce, Cordelia Schmid

Viaarxiv icon