Alert button
Picture for Zhuowan Li

Zhuowan Li

Alert button

Johns Hopkins University

Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA

Add code
Bookmark button
Alert button
Mar 28, 2024
Zhuowan Li, Bhavan Jasani, Peng Tang, Shabnam Ghadar

Viaarxiv icon

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

Add code
Bookmark button
Alert button
Dec 09, 2023
Shitian Zhao, Zhuowan Li, Yadong Lu, Alan Yuille, Yan Wang

Viaarxiv icon

3D-Aware Visual Question Answering about Parts, Poses and Occlusions

Add code
Bookmark button
Alert button
Oct 27, 2023
Xingrui Wang, Wufei Ma, Zhuowan Li, Adam Kortylewski, Alan Yuille

Figure 1 for 3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Figure 2 for 3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Figure 3 for 3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Figure 4 for 3D-Aware Visual Question Answering about Parts, Poses and Occlusions
Viaarxiv icon

Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?

Add code
Bookmark button
Alert button
Dec 01, 2022
Zhuowan Li, Cihang Xie, Benjamin Van Durme, Alan Yuille

Figure 1 for Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?
Figure 2 for Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?
Figure 3 for Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?
Figure 4 for Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?
Viaarxiv icon

Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning

Add code
Bookmark button
Alert button
Dec 01, 2022
Zhuowan Li, Xingrui Wang, Elias Stengel-Eskin, Adam Kortylewski, Wufei Ma, Benjamin Van Durme, Alan Yuille

Figure 1 for Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning
Figure 2 for Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning
Figure 3 for Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning
Figure 4 for Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning
Viaarxiv icon

Visual Commonsense in Pretrained Unimodal and Multimodal Models

Add code
Bookmark button
Alert button
May 04, 2022
Chenyu Zhang, Benjamin Van Durme, Zhuowan Li, Elias Stengel-Eskin

Figure 1 for Visual Commonsense in Pretrained Unimodal and Multimodal Models
Figure 2 for Visual Commonsense in Pretrained Unimodal and Multimodal Models
Figure 3 for Visual Commonsense in Pretrained Unimodal and Multimodal Models
Figure 4 for Visual Commonsense in Pretrained Unimodal and Multimodal Models
Viaarxiv icon

SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering

Add code
Bookmark button
Alert button
Apr 05, 2022
Vipul Gupta, Zhuowan Li, Adam Kortylewski, Chenyu Zhang, Yingwei Li, Alan Yuille

Figure 1 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 2 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 3 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Figure 4 for SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Viaarxiv icon

Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images

Add code
Bookmark button
Alert button
Oct 01, 2021
Zhuowan Li, Elias Stengel-Eskin, Yixiao Zhang, Cihang Xie, Quan Tran, Benjamin Van Durme, Alan Yuille

Figure 1 for Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
Figure 2 for Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
Figure 3 for Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
Figure 4 for Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
Viaarxiv icon

Context-Aware Group Captioning via Self-Attention and Contrastive Features

Add code
Bookmark button
Alert button
Apr 07, 2020
Zhuowan Li, Quan Tran, Long Mai, Zhe Lin, Alan Yuille

Figure 1 for Context-Aware Group Captioning via Self-Attention and Contrastive Features
Figure 2 for Context-Aware Group Captioning via Self-Attention and Contrastive Features
Figure 3 for Context-Aware Group Captioning via Self-Attention and Contrastive Features
Figure 4 for Context-Aware Group Captioning via Self-Attention and Contrastive Features
Viaarxiv icon