Picture for Shijian Lu

Shijian Lu

Nanyang Technological University

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

Add code
Jun 18, 2024
Figure 1 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 2 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 3 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Figure 4 for AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
Viaarxiv icon

MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era

Add code
Jun 13, 2024
Viaarxiv icon

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

Add code
May 13, 2024
Viaarxiv icon

Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders

Add code
May 02, 2024
Figure 1 for Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders
Figure 2 for Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders
Figure 3 for Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders
Figure 4 for Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders
Viaarxiv icon

MixLight: Borrowing the Best of both Spherical Harmonics and Gaussian Models

Add code
Apr 19, 2024
Figure 1 for MixLight: Borrowing the Best of both Spherical Harmonics and Gaussian Models
Figure 2 for MixLight: Borrowing the Best of both Spherical Harmonics and Gaussian Models
Figure 3 for MixLight: Borrowing the Best of both Spherical Harmonics and Gaussian Models
Figure 4 for MixLight: Borrowing the Best of both Spherical Harmonics and Gaussian Models
Viaarxiv icon

Efficient Test-Time Adaptation of Vision-Language Models

Add code
Mar 27, 2024
Figure 1 for Efficient Test-Time Adaptation of Vision-Language Models
Figure 2 for Efficient Test-Time Adaptation of Vision-Language Models
Figure 3 for Efficient Test-Time Adaptation of Vision-Language Models
Figure 4 for Efficient Test-Time Adaptation of Vision-Language Models
Viaarxiv icon

Masked AutoDecoder is Effective Multi-Task Vision Generalist

Add code
Mar 14, 2024
Figure 1 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 2 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 3 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 4 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Viaarxiv icon

StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting

Add code
Mar 12, 2024
Figure 1 for StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Figure 2 for StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Figure 3 for StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Figure 4 for StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Viaarxiv icon

FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization

Add code
Mar 11, 2024
Figure 1 for FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization
Figure 2 for FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization
Figure 3 for FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization
Figure 4 for FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization
Viaarxiv icon

Weakly Supervised Monocular 3D Detection with a Single-View Image

Add code
Feb 29, 2024
Figure 1 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 2 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 3 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 4 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Viaarxiv icon