Alert button
Picture for Shutao Li

Shutao Li

Alert button

Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation

Mar 20, 2024
Linshan Wu, Zhun Zhong, Jiayi Ma, Yunchao Wei, Hao Chen, Leyuan Fang, Shutao Li

Viaarxiv icon

GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering

Feb 04, 2024
Ziyu Ma, Shutao Li, Bin Sun, Jianfei Cai, Zuxiang Long, Fuyan Ma

Viaarxiv icon

Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition

Oct 16, 2023
Jun Zhang, Lipeng Zhu, Chao Wang, Shutao Li

Figure 1 for Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition
Figure 2 for Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition
Figure 3 for Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition
Figure 4 for Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition
Viaarxiv icon

VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation

Jun 11, 2023
Xu Zhang, Kailun Yang, Jiacheng Lin, Jin Yuan, Zhiyong Li, Shutao Li

Figure 1 for VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation
Figure 2 for VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation
Figure 3 for VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation
Figure 4 for VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation
Viaarxiv icon

AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation

May 07, 2023
Jiacheng Lin, Jiajun Chen, Kailun Yang, Alina Roitberg, Siyu Li, Zhiyong Li, Shutao Li

Figure 1 for AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation
Figure 2 for AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation
Figure 3 for AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation
Figure 4 for AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation
Viaarxiv icon

LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition

May 05, 2023
Fuyan Ma, Bin Sun, Shutao Li

Figure 1 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 2 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 3 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Figure 4 for LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition
Viaarxiv icon

Learning to Locate Visual Answer in Video Corpus Using Question

Oct 11, 2022
Bin Li, Yixuan Weng, Bin Sun, Shutao Li

Figure 1 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 2 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 3 for Learning to Locate Visual Answer in Video Corpus Using Question
Figure 4 for Learning to Locate Visual Answer in Video Corpus Using Question
Viaarxiv icon

Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation

Jul 05, 2022
Bin Li, Yixuan Weng, Ziyu Ma, Bin Sun, Shutao Li

Figure 1 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 2 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 3 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Figure 4 for Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation
Viaarxiv icon

Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild

May 10, 2022
Fuyan Ma, Bin Sun, Shutao Li

Figure 1 for Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Figure 2 for Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Figure 3 for Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Figure 4 for Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild
Viaarxiv icon

LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs

Apr 20, 2022
Fei Xia, Bin Li, Yixuan Weng, Shizhu He, Kang Liu, Bin Sun, Shutao Li, Jun Zhao

Figure 1 for LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs
Figure 2 for LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs
Figure 3 for LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs
Figure 4 for LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs
Viaarxiv icon