Picture for Kentaro Takemoto

Kentaro Takemoto

D3: Data Diversity Design for Systematic Generalization in Visual Question Answering

Add code
Sep 15, 2023
Figure 1 for D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Figure 2 for D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Figure 3 for D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Figure 4 for D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Viaarxiv icon

HICO-DET-SG and V-COCO-SG: New Data Splits to Evaluate Systematic Generalization in Human-Object Interaction Detection

Add code
May 17, 2023
Figure 1 for HICO-DET-SG and V-COCO-SG: New Data Splits to Evaluate Systematic Generalization in Human-Object Interaction Detection
Figure 2 for HICO-DET-SG and V-COCO-SG: New Data Splits to Evaluate Systematic Generalization in Human-Object Interaction Detection
Figure 3 for HICO-DET-SG and V-COCO-SG: New Data Splits to Evaluate Systematic Generalization in Human-Object Interaction Detection
Figure 4 for HICO-DET-SG and V-COCO-SG: New Data Splits to Evaluate Systematic Generalization in Human-Object Interaction Detection
Viaarxiv icon

Transformer Module Networks for Systematic Generalization in Visual Question Answering

Add code
Jan 27, 2022
Figure 1 for Transformer Module Networks for Systematic Generalization in Visual Question Answering
Figure 2 for Transformer Module Networks for Systematic Generalization in Visual Question Answering
Figure 3 for Transformer Module Networks for Systematic Generalization in Visual Question Answering
Figure 4 for Transformer Module Networks for Systematic Generalization in Visual Question Answering
Viaarxiv icon

Multimodal Explanations by Predicting Counterfactuality in Videos

Add code
Dec 04, 2018
Figure 1 for Multimodal Explanations by Predicting Counterfactuality in Videos
Figure 2 for Multimodal Explanations by Predicting Counterfactuality in Videos
Figure 3 for Multimodal Explanations by Predicting Counterfactuality in Videos
Figure 4 for Multimodal Explanations by Predicting Counterfactuality in Videos
Viaarxiv icon