Alert button
Picture for Wenliang Dai

Wenliang Dai

Alert button

Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models

Oct 09, 2023
Holy Lovenia, Wenliang Dai, Samuel Cahyawijaya, Ziwei Ji, Pascale Fung

Viaarxiv icon

Survey of Social Bias in Vision-Language Models

Sep 24, 2023
Nayeon Lee, Yejin Bang, Holy Lovenia, Samuel Cahyawijaya, Wenliang Dai, Pascale Fung

Viaarxiv icon

Visual Instruction Tuning with Polite Flamingo

Jul 03, 2023
Delong Chen, Jianfeng Liu, Wenliang Dai, Baoyuan Wang

Figure 1 for Visual Instruction Tuning with Polite Flamingo
Figure 2 for Visual Instruction Tuning with Polite Flamingo
Figure 3 for Visual Instruction Tuning with Polite Flamingo
Figure 4 for Visual Instruction Tuning with Polite Flamingo
Viaarxiv icon

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

May 11, 2023
Wenliang Dai, Junnan Li, Dongxu Li, Anthony Meng Huat Tiong, Junqi Zhao, Weisheng Wang, Boyang Li, Pascale Fung, Steven Hoi

Figure 1 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 2 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 3 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Figure 4 for InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Viaarxiv icon

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity

Feb 28, 2023
Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu, Pascale Fung

Figure 1 for A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Figure 2 for A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Figure 3 for A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Figure 4 for A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Viaarxiv icon

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

Dec 20, 2022
Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Ignatius, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Noor Fatyanosa, Ziwei Ji, Pascale Fung, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti

Figure 1 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 2 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 3 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 4 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Viaarxiv icon

Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training

Oct 14, 2022
Wenliang Dai, Zihan Liu, Ziwei Ji, Dan Su, Pascale Fung

Figure 1 for Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Figure 2 for Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Figure 3 for Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Figure 4 for Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Viaarxiv icon

Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands

Jul 06, 2022
Wenliang Dai, Samuel Cahyawijaya, Tiezheng Yu, Elham J Barezi, Pascale Fung

Figure 1 for Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands
Figure 2 for Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands
Figure 3 for Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands
Figure 4 for Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands
Viaarxiv icon

Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation

Mar 30, 2022
Wenliang Dai, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Pascale Fung

Figure 1 for Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation
Figure 2 for Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation
Figure 3 for Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation
Figure 4 for Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation
Viaarxiv icon