Picture for Wenbin An

Wenbin An

Boosting SAM for Cross-Domain Few-Shot Segmentation via Conditional Point Sparsification

Add code
Feb 05, 2026
Viaarxiv icon

E.M.Ground: A Temporal Grounding Vid-LLM with Holistic Event Perception and Matching

Add code
Feb 05, 2026
Viaarxiv icon

Cross-Domain Few-Shot Segmentation via Multi-view Progressive Adaptation

Add code
Feb 05, 2026
Viaarxiv icon

A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future

Add code
Dec 18, 2024
Figure 1 for A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Figure 2 for A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Figure 3 for A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Figure 4 for A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future
Viaarxiv icon

Unleashing the Potential of Model Bias for Generalized Category Discovery

Add code
Dec 17, 2024
Figure 1 for Unleashing the Potential of Model Bias for Generalized Category Discovery
Figure 2 for Unleashing the Potential of Model Bias for Generalized Category Discovery
Figure 3 for Unleashing the Potential of Model Bias for Generalized Category Discovery
Figure 4 for Unleashing the Potential of Model Bias for Generalized Category Discovery
Viaarxiv icon

Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing

Add code
Oct 24, 2024
Figure 1 for Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
Figure 2 for Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
Figure 3 for Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
Figure 4 for Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
Viaarxiv icon

Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery

Add code
Sep 29, 2024
Figure 1 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 2 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 3 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 4 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Viaarxiv icon

Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models

Add code
Jul 22, 2024
Figure 1 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 2 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 3 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 4 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Viaarxiv icon

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Add code
Jun 18, 2024
Figure 1 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 2 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 3 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 4 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Viaarxiv icon

MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era

Add code
Jun 13, 2024
Figure 1 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 2 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 3 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 4 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Viaarxiv icon