Picture for Jiaxing Huang

Jiaxing Huang

LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

A Survey on Evaluation of Multimodal Large Language Models

Add code
Aug 28, 2024
Viaarxiv icon

Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures

Add code
Jul 20, 2024
Viaarxiv icon

Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans

Add code
Mar 22, 2024
Viaarxiv icon

Masked AutoDecoder is Effective Multi-Task Vision Generalist

Add code
Mar 14, 2024
Figure 1 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 2 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 3 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 4 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Viaarxiv icon

LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors

Add code
Feb 07, 2024
Viaarxiv icon

DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception

Add code
Jan 13, 2024
Viaarxiv icon

Domain Adaptation for Large-Vocabulary Object Detectors

Add code
Jan 13, 2024
Viaarxiv icon

Learning to Prompt Segment Anything Models

Add code
Jan 09, 2024
Viaarxiv icon

Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey

Add code
Dec 27, 2023
Viaarxiv icon