Picture for Xiaoqin Zhang

Xiaoqin Zhang

Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection

Add code
Jul 17, 2024
Viaarxiv icon

CorrMAE: Pre-training Correspondence Transformers with Masked Autoencoder

Add code
Jun 09, 2024
Viaarxiv icon

One-shot Training for Video Object Segmentation

Add code
May 22, 2024
Figure 1 for One-shot Training for Video Object Segmentation
Figure 2 for One-shot Training for Video Object Segmentation
Figure 3 for One-shot Training for Video Object Segmentation
Figure 4 for One-shot Training for Video Object Segmentation
Viaarxiv icon

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

Add code
May 13, 2024
Viaarxiv icon

Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

Add code
Apr 18, 2024
Figure 1 for Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Figure 2 for Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Figure 3 for Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Figure 4 for Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Viaarxiv icon

Masked AutoDecoder is Effective Multi-Task Vision Generalist

Add code
Mar 14, 2024
Figure 1 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 2 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 3 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Figure 4 for Masked AutoDecoder is Effective Multi-Task Vision Generalist
Viaarxiv icon

Weakly Supervised Monocular 3D Detection with a Single-View Image

Add code
Feb 29, 2024
Figure 1 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 2 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 3 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 4 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Viaarxiv icon

CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model

Add code
Feb 06, 2024
Figure 1 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model
Figure 2 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model
Figure 3 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model
Figure 4 for CAT-SAM: Conditional Tuning Network for Few-Shot Adaptation of Segmentation Anything Model
Viaarxiv icon

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

Add code
Jan 04, 2024
Figure 1 for VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
Figure 2 for VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
Figure 3 for VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
Figure 4 for VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning
Viaarxiv icon

High-Order Tensor Recovery with A Tensor $U_1$ Norm

Add code
Nov 23, 2023
Figure 1 for High-Order Tensor Recovery with A Tensor $U_1$ Norm
Figure 2 for High-Order Tensor Recovery with A Tensor $U_1$ Norm
Figure 3 for High-Order Tensor Recovery with A Tensor $U_1$ Norm
Figure 4 for High-Order Tensor Recovery with A Tensor $U_1$ Norm
Viaarxiv icon