Picture for Shijian Lu

Shijian Lu

Nanyang Technological University

Multimodal 3D Reasoning Segmentation with Complex Scenes

Add code
Nov 21, 2024
Viaarxiv icon

Open-Vocabulary Object Detection via Language Hierarchy

Add code
Oct 27, 2024
Figure 1 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 2 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 3 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 4 for Open-Vocabulary Object Detection via Language Hierarchy
Viaarxiv icon

Historical Test-time Prompt Tuning for Vision Foundation Models

Add code
Oct 27, 2024
Figure 1 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 2 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 3 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 4 for Historical Test-time Prompt Tuning for Vision Foundation Models
Viaarxiv icon

Foundation Models for Remote Sensing and Earth Observation: A Survey

Add code
Oct 22, 2024
Viaarxiv icon

Mitigating Object Hallucination via Concentric Causal Attention

Add code
Oct 21, 2024
Viaarxiv icon

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Add code
Oct 16, 2024
Figure 1 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 2 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 3 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Figure 4 for The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Viaarxiv icon

LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models

Add code
Oct 15, 2024
Figure 1 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 2 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 3 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Figure 4 for LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models
Viaarxiv icon

Segment Anything with Multiple Modalities

Add code
Aug 17, 2024
Figure 1 for Segment Anything with Multiple Modalities
Figure 2 for Segment Anything with Multiple Modalities
Figure 3 for Segment Anything with Multiple Modalities
Figure 4 for Segment Anything with Multiple Modalities
Viaarxiv icon

Unlearnable Examples Detection via Iterative Filtering

Add code
Aug 15, 2024
Figure 1 for Unlearnable Examples Detection via Iterative Filtering
Figure 2 for Unlearnable Examples Detection via Iterative Filtering
Figure 3 for Unlearnable Examples Detection via Iterative Filtering
Figure 4 for Unlearnable Examples Detection via Iterative Filtering
Viaarxiv icon

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Add code
Jun 18, 2024
Figure 1 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 2 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 3 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 4 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Viaarxiv icon