Alert button
Picture for Yanwei Li

Yanwei Li

Alert button

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Bookmark button
Alert button
Mar 27, 2024
Yanwei Li, Yuechen Zhang, Chengyao Wang, Zhisheng Zhong, Yixin Chen, Ruihang Chu, Shaoteng Liu, Jiaya Jia

Viaarxiv icon

RL-GPT: Integrating Reinforcement Learning and Code-as-policy

Add code
Bookmark button
Alert button
Feb 29, 2024
Shaoteng Liu, Haoqi Yuan, Minda Hu, Yanwei Li, Yukang Chen, Shu Liu, Zongqing Lu, Jiaya Jia

Viaarxiv icon

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Add code
Bookmark button
Alert button
Nov 28, 2023
Yanwei Li, Chengyao Wang, Jiaya Jia

Viaarxiv icon

LISA: Reasoning Segmentation via Large Language Model

Add code
Bookmark button
Alert button
Aug 03, 2023
Xin Lai, Zhuotao Tian, Yukang Chen, Yanwei Li, Yuhui Yuan, Shu Liu, Jiaya Jia

Figure 1 for LISA: Reasoning Segmentation via Large Language Model
Figure 2 for LISA: Reasoning Segmentation via Large Language Model
Figure 3 for LISA: Reasoning Segmentation via Large Language Model
Figure 4 for LISA: Reasoning Segmentation via Large Language Model
Viaarxiv icon

Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-empowered Learning

Add code
Bookmark button
Alert button
May 31, 2023
Ruining Deng, Yanwei Li, Peize Li, Jiacheng Wang, Lucas W. Remedios, Saydolimkhon Agzamkhodjaev, Zuhayr Asad, Quan Liu, Can Cui, Yucheng Tang, Haichun Yang, Yuankai Huo

Figure 1 for Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-empowered Learning
Figure 2 for Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-empowered Learning
Figure 3 for Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-empowered Learning
Figure 4 for Democratizing Pathological Image Segmentation with Lay Annotators via Molecular-empowered Learning
Viaarxiv icon

GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction

Add code
Bookmark button
Alert button
May 30, 2023
Rui Yang, Lin Song, Yanwei Li, Sijie Zhao, Yixiao Ge, Xiu Li, Ying Shan

Figure 1 for GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Figure 2 for GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Figure 3 for GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Figure 4 for GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction
Viaarxiv icon

Diversified Dynamic Routing for Vision Tasks

Add code
Bookmark button
Alert button
Sep 26, 2022
Botos Csaba, Adel Bibi, Yanwei Li, Philip Torr, Ser-Nam Lim

Figure 1 for Diversified Dynamic Routing for Vision Tasks
Figure 2 for Diversified Dynamic Routing for Vision Tasks
Figure 3 for Diversified Dynamic Routing for Vision Tasks
Figure 4 for Diversified Dynamic Routing for Vision Tasks
Viaarxiv icon

Unifying Voxel-based Representation with Transformer for 3D Object Detection

Add code
Bookmark button
Alert button
Jun 01, 2022
Yanwei Li, Yilun Chen, Xiaojuan Qi, Zeming Li, Jian Sun, Jiaya Jia

Figure 1 for Unifying Voxel-based Representation with Transformer for 3D Object Detection
Figure 2 for Unifying Voxel-based Representation with Transformer for 3D Object Detection
Figure 3 for Unifying Voxel-based Representation with Transformer for 3D Object Detection
Figure 4 for Unifying Voxel-based Representation with Transformer for 3D Object Detection
Viaarxiv icon

Voxel Field Fusion for 3D Object Detection

Add code
Bookmark button
Alert button
May 31, 2022
Yanwei Li, Xiaojuan Qi, Yukang Chen, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia

Figure 1 for Voxel Field Fusion for 3D Object Detection
Figure 2 for Voxel Field Fusion for 3D Object Detection
Figure 3 for Voxel Field Fusion for 3D Object Detection
Figure 4 for Voxel Field Fusion for 3D Object Detection
Viaarxiv icon