Picture for Bernt Schiele

Bernt Schiele

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Add code
Mar 14, 2024
Figure 1 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 2 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 3 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 4 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Viaarxiv icon

Adaptive Hierarchical Certification for Segmentation using Randomized Smoothing

Add code
Feb 13, 2024
Viaarxiv icon

Good Teachers Explain: Explanation-Enhanced Knowledge Distillation

Add code
Feb 05, 2024
Figure 1 for Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
Figure 2 for Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
Figure 3 for Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
Figure 4 for Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
Viaarxiv icon

Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

Add code
Jan 07, 2024
Viaarxiv icon

SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning

Add code
Nov 17, 2023
Figure 1 for SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning
Figure 2 for SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning
Figure 3 for SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning
Figure 4 for SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning
Viaarxiv icon

Wakening Past Concepts without Past Data: Class-Incremental Learning from Online Placebos

Add code
Oct 24, 2023
Viaarxiv icon

HowToCaption: Prompting LLMs to Transform Video Annotations at Scale

Add code
Oct 07, 2023
Figure 1 for HowToCaption: Prompting LLMs to Transform Video Annotations at Scale
Figure 2 for HowToCaption: Prompting LLMs to Transform Video Annotations at Scale
Figure 3 for HowToCaption: Prompting LLMs to Transform Video Annotations at Scale
Figure 4 for HowToCaption: Prompting LLMs to Transform Video Annotations at Scale
Viaarxiv icon

DARTH: Holistic Test-time Adaptation for Multiple Object Tracking

Add code
Oct 03, 2023
Viaarxiv icon

Unsupervised Open-Vocabulary Object Localization in Videos

Add code
Sep 18, 2023
Figure 1 for Unsupervised Open-Vocabulary Object Localization in Videos
Figure 2 for Unsupervised Open-Vocabulary Object Localization in Videos
Figure 3 for Unsupervised Open-Vocabulary Object Localization in Videos
Figure 4 for Unsupervised Open-Vocabulary Object Localization in Videos
Viaarxiv icon

In-Style: Bridging Text and Uncurated Videos with Style Transfer for Text-Video Retrieval

Add code
Sep 16, 2023
Figure 1 for In-Style: Bridging Text and Uncurated Videos with Style Transfer for Text-Video Retrieval
Figure 2 for In-Style: Bridging Text and Uncurated Videos with Style Transfer for Text-Video Retrieval
Figure 3 for In-Style: Bridging Text and Uncurated Videos with Style Transfer for Text-Video Retrieval
Figure 4 for In-Style: Bridging Text and Uncurated Videos with Style Transfer for Text-Video Retrieval
Viaarxiv icon