Picture for Jianwei Yang

Jianwei Yang

V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

Add code
Jun 17, 2024
Viaarxiv icon

DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs

Add code
Jun 06, 2024
Viaarxiv icon

Matryoshka Multimodal Models

Add code
May 27, 2024
Viaarxiv icon

BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

Add code
May 21, 2024
Viaarxiv icon

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Add code
Apr 25, 2024
Figure 1 for List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Figure 2 for List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Figure 3 for List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Figure 4 for List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
Viaarxiv icon

Efficient Modulation for Vision Networks

Add code
Mar 29, 2024
Figure 1 for Efficient Modulation for Vision Networks
Figure 2 for Efficient Modulation for Vision Networks
Figure 3 for Efficient Modulation for Vision Networks
Figure 4 for Efficient Modulation for Vision Networks
Viaarxiv icon

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging

Add code
Mar 20, 2024
Figure 1 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 2 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 3 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 4 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Viaarxiv icon

Pix2Gif: Motion-Guided Diffusion for GIF Generation

Add code
Mar 08, 2024
Figure 1 for Pix2Gif: Motion-Guided Diffusion for GIF Generation
Figure 2 for Pix2Gif: Motion-Guided Diffusion for GIF Generation
Figure 3 for Pix2Gif: Motion-Guided Diffusion for GIF Generation
Figure 4 for Pix2Gif: Motion-Guided Diffusion for GIF Generation
Viaarxiv icon

Foundation Models for Biomedical Image Segmentation: A Survey

Add code
Jan 15, 2024
Viaarxiv icon

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon