Alert button
Picture for Yiwu Zhong

Yiwu Zhong

Alert button

Towards Learning a Generalist Model for Embodied Navigation

Dec 06, 2023
Duo Zheng, Shijia Huang, Lin Zhao, Yiwu Zhong, Liwei Wang

Viaarxiv icon

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Nov 13, 2023
An Yan, Zhengyuan Yang, Wanrong Zhu, Kevin Lin, Linjie Li, Jianfeng Wang, Jianwei Yang, Yiwu Zhong, Julian McAuley, Jianfeng Gao, Zicheng Liu, Lijuan Wang

Viaarxiv icon

Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models

Oct 04, 2023
An Yan, Yu Wang, Yiwu Zhong, Zexue He, Petros Karypis, Zihan Wang, Chengyu Dong, Amilcare Gentili, Chun-Nan Hsu, Jingbo Shang, Julian McAuley

Figure 1 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 2 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 3 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Figure 4 for Robust and Interpretable Medical Image Classifiers via Concept Bottleneck Models
Viaarxiv icon

Learning Concise and Descriptive Attributes for Visual Recognition

Aug 07, 2023
An Yan, Yu Wang, Yiwu Zhong, Chengyu Dong, Zexue He, Yujie Lu, William Wang, Jingbo Shang, Julian McAuley

Figure 1 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 2 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 3 for Learning Concise and Descriptive Attributes for Visual Recognition
Figure 4 for Learning Concise and Descriptive Attributes for Visual Recognition
Viaarxiv icon

Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations

Mar 31, 2023
Yiwu Zhong, Licheng Yu, Yang Bai, Shangwen Li, Xueting Yan, Yin Li

Figure 1 for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Figure 2 for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Figure 3 for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Figure 4 for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Viaarxiv icon

RegionCLIP: Region-based Language-Image Pretraining

Dec 16, 2021
Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao

Figure 1 for RegionCLIP: Region-based Language-Image Pretraining
Figure 2 for RegionCLIP: Region-based Language-Image Pretraining
Figure 3 for RegionCLIP: Region-based Language-Image Pretraining
Figure 4 for RegionCLIP: Region-based Language-Image Pretraining
Viaarxiv icon

Grounded Language-Image Pre-training

Dec 07, 2021
Liunian Harold Li, Pengchuan Zhang, Haotian Zhang, Jianwei Yang, Chunyuan Li, Yiwu Zhong, Lijuan Wang, Lu Yuan, Lei Zhang, Jenq-Neng Hwang, Kai-Wei Chang, Jianfeng Gao

Figure 1 for Grounded Language-Image Pre-training
Figure 2 for Grounded Language-Image Pre-training
Figure 3 for Grounded Language-Image Pre-training
Figure 4 for Grounded Language-Image Pre-training
Viaarxiv icon

Learning to Generate Scene Graph from Natural Language Supervision

Sep 06, 2021
Yiwu Zhong, Jing Shi, Jianwei Yang, Chenliang Xu, Yin Li

Figure 1 for Learning to Generate Scene Graph from Natural Language Supervision
Figure 2 for Learning to Generate Scene Graph from Natural Language Supervision
Figure 3 for Learning to Generate Scene Graph from Natural Language Supervision
Figure 4 for Learning to Generate Scene Graph from Natural Language Supervision
Viaarxiv icon

Comprehensive Image Captioning via Scene Graph Decomposition

Jul 23, 2020
Yiwu Zhong, Liwei Wang, Jianshu Chen, Dong Yu, Yin Li

Figure 1 for Comprehensive Image Captioning via Scene Graph Decomposition
Figure 2 for Comprehensive Image Captioning via Scene Graph Decomposition
Figure 3 for Comprehensive Image Captioning via Scene Graph Decomposition
Figure 4 for Comprehensive Image Captioning via Scene Graph Decomposition
Viaarxiv icon