Visualbert


Visual Question Answering on Multiple Remote Sensing Image Modalities

Add code
May 21, 2025
Viaarxiv icon

Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes

Add code
Oct 17, 2024
Figure 1 for Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes
Figure 2 for Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes
Figure 3 for Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes
Figure 4 for Seeing Through VisualBERT: A Causal Adventure on Memetic Landscapes
Viaarxiv icon

OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst

Add code
Jun 14, 2024
Figure 1 for OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
Figure 2 for OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
Viaarxiv icon

Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking

Add code
Jan 29, 2024
Viaarxiv icon

A Review of Vision-Language Models and their Performance on the Hateful Memes Challenge

Add code
May 09, 2023
Viaarxiv icon

Controlling for Stereotypes in Multimodal Language Model Evaluation

Add code
Feb 03, 2023
Viaarxiv icon

Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision

Add code
Oct 27, 2022
Figure 1 for Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision
Figure 2 for Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision
Figure 3 for Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision
Figure 4 for Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision
Viaarxiv icon

Transfer Learning with Joint Fine-Tuning for Multimodal Sentiment Analysis

Add code
Oct 11, 2022
Figure 1 for Transfer Learning with Joint Fine-Tuning for Multimodal Sentiment Analysis
Figure 2 for Transfer Learning with Joint Fine-Tuning for Multimodal Sentiment Analysis
Viaarxiv icon

Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing

Add code
Oct 10, 2022
Figure 1 for Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing
Figure 2 for Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing
Figure 3 for Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing
Figure 4 for Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing
Viaarxiv icon

Knowledge Graph Completion with Pre-trained Multimodal Transformer and Twins Negative Sampling

Add code
Sep 15, 2022
Figure 1 for Knowledge Graph Completion with Pre-trained Multimodal Transformer and Twins Negative Sampling
Figure 2 for Knowledge Graph Completion with Pre-trained Multimodal Transformer and Twins Negative Sampling
Figure 3 for Knowledge Graph Completion with Pre-trained Multimodal Transformer and Twins Negative Sampling
Figure 4 for Knowledge Graph Completion with Pre-trained Multimodal Transformer and Twins Negative Sampling
Viaarxiv icon