Alert button
Picture for Jianwei Yang

Jianwei Yang

Alert button

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging

Mar 20, 2024
Juan Manuel Zambrano Chaves, Shih-Cheng Huang, Yanbo Xu, Hanwen Xu, Naoto Usuyama, Sheng Zhang, Fei Wang, Yujia Xie, Mahmoud Khademi, Ziyi Yang, Hany Awadalla, Julia Gong, Houdong Hu, Jianwei Yang, Chunyuan Li, Jianfeng Gao, Yu Gu, Cliff Wong, Mu Wei, Tristan Naumann, Muhao Chen, Matthew P. Lungren, Serena Yeung-Levy, Curtis P. Langlotz, Sheng Wang, Hoifung Poon

Viaarxiv icon

Pix2Gif: Motion-Guided Diffusion for GIF Generation

Mar 08, 2024
Hitesh Kandala, Jianfeng Gao, Jianwei Yang

Figure 1 for Pix2Gif: Motion-Guided Diffusion for GIF Generation
Figure 2 for Pix2Gif: Motion-Guided Diffusion for GIF Generation
Figure 3 for Pix2Gif: Motion-Guided Diffusion for GIF Generation
Figure 4 for Pix2Gif: Motion-Guided Diffusion for GIF Generation
Viaarxiv icon

Foundation Models for Biomedical Image Segmentation: A Survey

Jan 15, 2024
Ho Hin Lee, Yu Gu, Theodore Zhao, Yanbo Xu, Jianwei Yang, Naoto Usuyama, Cliff Wong, Mu Wei, Bennett A. Landman, Yuankai Huo, Alberto Santamaria-Pang, Hoifung Poon

Viaarxiv icon

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Dec 21, 2023
Jitesh Jain, Jianwei Yang, Humphrey Shi

Viaarxiv icon

Interfacing Foundation Models' Embeddings

Dec 12, 2023
Xueyan Zou, Linjie Li, Jianfeng Wang, Jianwei Yang, Mingyu Ding, Zhengyuan Yang, Feng Li, Hao Zhang, Shilong Liu, Arul Aravinthan, Yong Jae Lee, Lijuan Wang

Viaarxiv icon

LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models

Dec 05, 2023
Hao Zhang, Hongyang Li, Feng Li, Tianhe Ren, Xueyan Zou, Shilong Liu, Shijia Huang, Jianfeng Gao, Lei Zhang, Chunyuan Li, Jianwei Yang

Viaarxiv icon

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Dec 04, 2023
Jiarui Xu, Yossi Gandelsman, Amir Bar, Jianwei Yang, Jianfeng Gao, Trevor Darrell, Xiaolong Wang

Figure 1 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 2 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 3 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Figure 4 for IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Viaarxiv icon

Visual In-Context Prompting

Nov 22, 2023
Feng Li, Qing Jiang, Hao Zhang, Tianhe Ren, Shilong Liu, Xueyan Zou, Huaizhe Xu, Hongyang Li, Chunyuan Li, Jianwei Yang, Lei Zhang, Jianfeng Gao

Viaarxiv icon

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation

Nov 13, 2023
An Yan, Zhengyuan Yang, Wanrong Zhu, Kevin Lin, Linjie Li, Jianfeng Wang, Jianwei Yang, Yiwu Zhong, Julian McAuley, Jianfeng Gao, Zicheng Liu, Lijuan Wang

Viaarxiv icon