Picture for Jieru Mei

Jieru Mei

What If We Recaption Billions of Web Images with LLaMA-3?

Add code
Jun 12, 2024
Figure 1 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 2 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 3 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 4 for What If We Recaption Billions of Web Images with LLaMA-3?
Viaarxiv icon

Autoregressive Pretraining with Mamba in Vision

Add code
Jun 11, 2024
Viaarxiv icon

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context

Add code
Jun 08, 2024
Viaarxiv icon

Mamba-R: Vision Mamba ALSO Needs Registers

Add code
May 23, 2024
Figure 1 for Mamba-R: Vision Mamba ALSO Needs Registers
Figure 2 for Mamba-R: Vision Mamba ALSO Needs Registers
Figure 3 for Mamba-R: Vision Mamba ALSO Needs Registers
Figure 4 for Mamba-R: Vision Mamba ALSO Needs Registers
Viaarxiv icon

3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge

Add code
Mar 23, 2024
Figure 1 for 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Figure 2 for 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Figure 3 for 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Figure 4 for 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Viaarxiv icon

SPFormer: Enhancing Vision Transformer with Superpixel Representation

Add code
Jan 05, 2024
Viaarxiv icon

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

Add code
Dec 21, 2023
Figure 1 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 2 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 3 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 4 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Viaarxiv icon

Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning

Add code
Dec 18, 2023
Viaarxiv icon

SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference

Add code
Dec 04, 2023
Viaarxiv icon

3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers

Add code
Oct 11, 2023
Figure 1 for 3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers
Figure 2 for 3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers
Figure 3 for 3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers
Figure 4 for 3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers
Viaarxiv icon