Picture for Sheng Jin

Sheng Jin

TCFormer: Visual Recognition via Token Clustering Transformer

Add code
Jul 16, 2024
Viaarxiv icon

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

Add code
Jul 14, 2024
Viaarxiv icon

F-LMM: Grounding Frozen Large Multimodal Models

Add code
Jun 09, 2024
Figure 1 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 2 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 3 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 4 for F-LMM: Grounding Frozen Large Multimodal Models
Viaarxiv icon

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

Add code
May 13, 2024
Viaarxiv icon

UniFS: Universal Few-shot Instance Perception with Point Representations

Add code
Apr 30, 2024
Figure 1 for UniFS: Universal Few-shot Instance Perception with Point Representations
Figure 2 for UniFS: Universal Few-shot Instance Perception with Point Representations
Figure 3 for UniFS: Universal Few-shot Instance Perception with Point Representations
Figure 4 for UniFS: Universal Few-shot Instance Perception with Point Representations
Viaarxiv icon

Weakly Supervised Monocular 3D Detection with a Single-View Image

Add code
Feb 29, 2024
Figure 1 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 2 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 3 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Figure 4 for Weakly Supervised Monocular 3D Detection with a Single-View Image
Viaarxiv icon

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks

Add code
Feb 23, 2024
Viaarxiv icon

LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors

Add code
Feb 07, 2024
Viaarxiv icon

CLIM: Contrastive Language-Image Mosaic for Region Representation

Add code
Dec 19, 2023
Viaarxiv icon

MCFNet: Multi-scale Covariance Feature Fusion Network for Real-time Semantic Segmentation

Add code
Dec 12, 2023
Figure 1 for MCFNet: Multi-scale Covariance Feature Fusion Network for Real-time Semantic Segmentation
Figure 2 for MCFNet: Multi-scale Covariance Feature Fusion Network for Real-time Semantic Segmentation
Figure 3 for MCFNet: Multi-scale Covariance Feature Fusion Network for Real-time Semantic Segmentation
Figure 4 for MCFNet: Multi-scale Covariance Feature Fusion Network for Real-time Semantic Segmentation
Viaarxiv icon