Alert button
Picture for Jiale Cao

Jiale Cao

Alert button

VFMM3D: Releasing the Potential of Image by Vision Foundation Model for Monocular 3D Object Detection

Add code
Bookmark button
Alert button
Apr 15, 2024
Bonan Ding, Jin Xie, Jing Nie, Jiale Cao

Viaarxiv icon

Implicit and Explicit Language Guidance for Diffusion-based Visual Perception

Add code
Bookmark button
Alert button
Apr 11, 2024
Hefeng Wang, Jiale Cao, Jin Xie, Aiping Yang, Yanwei Pang

Viaarxiv icon

SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior

Add code
Bookmark button
Alert button
Mar 29, 2024
Zhongrui Yu, Haoran Wang, Jinze Yang, Hanzhang Wang, Zeke Xie, Yunfeng Cai, Jiale Cao, Zhong Ji, Mingming Sun

Figure 1 for SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior
Figure 2 for SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior
Figure 3 for SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior
Figure 4 for SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior
Viaarxiv icon

CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation

Add code
Bookmark button
Alert button
Mar 19, 2024
Wenqi Zhu, Jiale Cao, Jin Xie, Shuangming Yang, Yanwei Pang

Figure 1 for CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Figure 2 for CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Figure 3 for CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Figure 4 for CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Viaarxiv icon

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation

Add code
Bookmark button
Alert button
Nov 27, 2023
Bin Xie, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang

Figure 1 for SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Figure 2 for SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Figure 3 for SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Figure 4 for SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation
Viaarxiv icon

Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects

Add code
Bookmark button
Alert button
Sep 22, 2023
Feng Yan, Xiaoheng Jiang, Yang Lu, Lisha Cui, Shupan Li, Jiale Cao, Mingliang Xu, Dacheng Tao

Figure 1 for Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects
Figure 2 for Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects
Figure 3 for Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects
Figure 4 for Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects
Viaarxiv icon

CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation

Add code
Bookmark button
Alert button
Sep 22, 2023
Xiaoheng Jiang, Kaiyi Guo, Yang Lu, Feng Yan, Hao Liu, Jiale Cao, Mingliang Xu, Dacheng Tao

Figure 1 for CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Figure 2 for CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Figure 3 for CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Figure 4 for CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Viaarxiv icon

A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos

Add code
Bookmark button
Alert button
Sep 09, 2023
Chao Qin, Jiale Cao, Huazhu Fu, Rao Muhammad Anwer, Fahad Shahbaz Khan

Figure 1 for A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Figure 2 for A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Figure 3 for A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Figure 4 for A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Viaarxiv icon

DFormer: Diffusion-guided Transformer for Universal Image Segmentation

Add code
Bookmark button
Alert button
Jun 08, 2023
Hefeng Wang, Jiale Cao, Rao Muhammad Anwer, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang

Figure 1 for DFormer: Diffusion-guided Transformer for Universal Image Segmentation
Figure 2 for DFormer: Diffusion-guided Transformer for Universal Image Segmentation
Figure 3 for DFormer: Diffusion-guided Transformer for Universal Image Segmentation
Figure 4 for DFormer: Diffusion-guided Transformer for Universal Image Segmentation
Viaarxiv icon

Transformer-based stereo-aware 3D object detection from binocular images

Add code
Bookmark button
Alert button
Apr 24, 2023
Hanqing Sun, Yanwei Pang, Jiale Cao, Jin Xie, Xuelong Li

Figure 1 for Transformer-based stereo-aware 3D object detection from binocular images
Figure 2 for Transformer-based stereo-aware 3D object detection from binocular images
Figure 3 for Transformer-based stereo-aware 3D object detection from binocular images
Figure 4 for Transformer-based stereo-aware 3D object detection from binocular images
Viaarxiv icon