Picture for Gregor Geigle

Gregor Geigle

Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model

Add code
Jan 09, 2025
Figure 1 for Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model
Figure 2 for Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model
Figure 3 for Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model
Figure 4 for Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model
Viaarxiv icon

Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?

Add code
Jun 20, 2024
Figure 1 for Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?
Figure 2 for Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?
Figure 3 for Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?
Figure 4 for Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?
Viaarxiv icon

African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification

Add code
Jun 20, 2024
Viaarxiv icon

High-Quality Image Restoration Following Human Instructions

Add code
Jan 31, 2024
Viaarxiv icon

mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs

Add code
Jul 13, 2023
Viaarxiv icon

Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations

Add code
Jun 14, 2023
Figure 1 for Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations
Figure 2 for Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations
Figure 3 for Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations
Figure 4 for Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations
Viaarxiv icon

One does not fit all! On the Complementarity of Vision Encoders for Vision and Language Tasks

Add code
Oct 12, 2022
Viaarxiv icon

UKP-SQUARE: An Online Platform for Question Answering Research

Add code
Mar 28, 2022
Figure 1 for UKP-SQUARE: An Online Platform for Question Answering Research
Figure 2 for UKP-SQUARE: An Online Platform for Question Answering Research
Figure 3 for UKP-SQUARE: An Online Platform for Question Answering Research
Figure 4 for UKP-SQUARE: An Online Platform for Question Answering Research
Viaarxiv icon

xGQA: Cross-Lingual Visual Question Answering

Add code
Sep 13, 2021
Figure 1 for xGQA: Cross-Lingual Visual Question Answering
Figure 2 for xGQA: Cross-Lingual Visual Question Answering
Figure 3 for xGQA: Cross-Lingual Visual Question Answering
Figure 4 for xGQA: Cross-Lingual Visual Question Answering
Viaarxiv icon

TWEAC: Transformer with Extendable QA Agent Classifiers

Add code
Apr 14, 2021
Figure 1 for TWEAC: Transformer with Extendable QA Agent Classifiers
Figure 2 for TWEAC: Transformer with Extendable QA Agent Classifiers
Figure 3 for TWEAC: Transformer with Extendable QA Agent Classifiers
Figure 4 for TWEAC: Transformer with Extendable QA Agent Classifiers
Viaarxiv icon