Alert button
Picture for Cordelia Schmid

Cordelia Schmid

Alert button

TubeDETR: Spatio-Temporal Video Grounding with Transformers

Add code
Bookmark button
Alert button
Mar 30, 2022
Antoine Yang, Antoine Miech, Josef Sivic, Ivan Laptev, Cordelia Schmid

Figure 1 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 2 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 3 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Figure 4 for TubeDETR: Spatio-Temporal Video Grounding with Transformers
Viaarxiv icon

Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems

Add code
Bookmark button
Alert button
Mar 11, 2022
Quentin Le Lidec, Louis Montaut, Cordelia Schmid, Ivan Laptev, Justin Carpentier

Figure 1 for Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Figure 2 for Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Figure 3 for Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Figure 4 for Leveraging Randomized Smoothing for Optimal Control of Nonsmooth Dynamical Systems
Viaarxiv icon

The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields

Add code
Bookmark button
Alert button
Feb 28, 2022
Pia Bideau, Erik Learned-Miller, Cordelia Schmid, Karteek Alahari

Figure 1 for The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields
Figure 2 for The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields
Figure 3 for The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields
Figure 4 for The Right Spin: Learning Object Motion from Rotation-Compensated Flow Fields
Viaarxiv icon

Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Feb 23, 2022
Shizhe Chen, Pierre-Louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev

Figure 1 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 2 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 3 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 4 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Viaarxiv icon

Learning with Neighbor Consistency for Noisy Labels

Add code
Bookmark button
Alert button
Feb 04, 2022
Ahmet Iscen, Jack Valmadre, Anurag Arnab, Cordelia Schmid

Figure 1 for Learning with Neighbor Consistency for Noisy Labels
Figure 2 for Learning with Neighbor Consistency for Noisy Labels
Figure 3 for Learning with Neighbor Consistency for Noisy Labels
Figure 4 for Learning with Neighbor Consistency for Noisy Labels
Viaarxiv icon

End-to-end Generative Pretraining for Multimodal Video Captioning

Add code
Bookmark button
Alert button
Jan 20, 2022
Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid

Figure 1 for End-to-end Generative Pretraining for Multimodal Video Captioning
Figure 2 for End-to-end Generative Pretraining for Multimodal Video Captioning
Figure 3 for End-to-end Generative Pretraining for Multimodal Video Captioning
Figure 4 for End-to-end Generative Pretraining for Multimodal Video Captioning
Viaarxiv icon

Multiview Transformers for Video Recognition

Add code
Bookmark button
Alert button
Jan 20, 2022
Shen Yan, Xuehan Xiong, Anurag Arnab, Zhichao Lu, Mi Zhang, Chen Sun, Cordelia Schmid

Figure 1 for Multiview Transformers for Video Recognition
Figure 2 for Multiview Transformers for Video Recognition
Figure 3 for Multiview Transformers for Video Recognition
Figure 4 for Multiview Transformers for Video Recognition
Viaarxiv icon

Masking Modalities for Cross-modal Video Retrieval

Add code
Bookmark button
Alert button
Nov 03, 2021
Valentin Gabeur, Arsha Nagrani, Chen Sun, Karteek Alahari, Cordelia Schmid

Figure 1 for Masking Modalities for Cross-modal Video Retrieval
Figure 2 for Masking Modalities for Cross-modal Video Retrieval
Figure 3 for Masking Modalities for Cross-modal Video Retrieval
Figure 4 for Masking Modalities for Cross-modal Video Retrieval
Viaarxiv icon