Picture for Georgios Tzimiropoulos

Georgios Tzimiropoulos

Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization

Add code
Dec 29, 2023
Figure 1 for Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization
Figure 2 for Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization
Figure 3 for Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization
Figure 4 for Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization
Viaarxiv icon

A Simple Baseline for Knowledge-Based Visual Question Answering

Add code
Oct 24, 2023
Figure 1 for A Simple Baseline for Knowledge-Based Visual Question Answering
Figure 2 for A Simple Baseline for Knowledge-Based Visual Question Answering
Figure 3 for A Simple Baseline for Knowledge-Based Visual Question Answering
Figure 4 for A Simple Baseline for Knowledge-Based Visual Question Answering
Viaarxiv icon

SimDETR: Simplifying self-supervised pretraining for DETR

Add code
Jul 28, 2023
Viaarxiv icon

HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces

Add code
Jul 20, 2023
Figure 1 for HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
Figure 2 for HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
Figure 3 for HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
Figure 4 for HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget Faces
Viaarxiv icon

Black Box Few-Shot Adaptation for Vision-Language models

Add code
Apr 04, 2023
Figure 1 for Black Box Few-Shot Adaptation for Vision-Language models
Figure 2 for Black Box Few-Shot Adaptation for Vision-Language models
Figure 3 for Black Box Few-Shot Adaptation for Vision-Language models
Figure 4 for Black Box Few-Shot Adaptation for Vision-Language models
Viaarxiv icon

DivClust: Controlling Diversity in Deep Clustering

Add code
Apr 03, 2023
Viaarxiv icon

Part-based Face Recognition with Vision Transformers

Add code
Nov 30, 2022
Viaarxiv icon

FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training

Add code
Oct 10, 2022
Figure 1 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Figure 2 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Figure 3 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Figure 4 for FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
Viaarxiv icon

Variational prompt tuning improves generalization of vision-language models

Add code
Oct 05, 2022
Figure 1 for Variational prompt tuning improves generalization of vision-language models
Figure 2 for Variational prompt tuning improves generalization of vision-language models
Figure 3 for Variational prompt tuning improves generalization of vision-language models
Figure 4 for Variational prompt tuning improves generalization of vision-language models
Viaarxiv icon

Language-Aware Soft Prompting for Vision & Language Foundation Models

Add code
Oct 03, 2022
Figure 1 for Language-Aware Soft Prompting for Vision & Language Foundation Models
Figure 2 for Language-Aware Soft Prompting for Vision & Language Foundation Models
Figure 3 for Language-Aware Soft Prompting for Vision & Language Foundation Models
Figure 4 for Language-Aware Soft Prompting for Vision & Language Foundation Models
Viaarxiv icon