Picture for Jiahao Nie

Jiahao Nie

Mamba-Adaptor: State Space Model Adaptor for Visual Recognition

Add code
May 19, 2025
Viaarxiv icon

Unleashing the Potential of Model Bias for Generalized Category Discovery

Add code
Dec 17, 2024
Viaarxiv icon

VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking

Add code
Aug 05, 2024
Viaarxiv icon

Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models

Add code
Jul 22, 2024
Figure 1 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 2 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 3 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 4 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Viaarxiv icon

P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point Clouds

Add code
Jul 09, 2024
Viaarxiv icon

Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models

Add code
Jun 27, 2024
Figure 1 for Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Figure 2 for Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Figure 3 for Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Figure 4 for Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Viaarxiv icon

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Add code
Jun 18, 2024
Figure 1 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 2 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 3 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 4 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Viaarxiv icon

MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era

Add code
Jun 13, 2024
Figure 1 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 2 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 3 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Figure 4 for MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era
Viaarxiv icon

Color Space Learning for Cross-Color Person Re-Identification

Add code
May 15, 2024
Viaarxiv icon

Towards Category Unification of 3D Single Object Tracking on Point Clouds

Add code
Jan 20, 2024
Viaarxiv icon