Alert button
Picture for Xiaohui Shen

Xiaohui Shen

Alert button

MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation

Nov 30, 2023
Ju He, Qihang Yu, Inkyu Shin, Xueqing Deng, Xiaohui Shen, Alan Yuille, Liang-Chieh Chen

Viaarxiv icon

Towards Open-Ended Visual Recognition with Large Language Model

Nov 14, 2023
Qihang Yu, Xiaohui Shen, Liang-Chieh Chen

Viaarxiv icon

Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Aug 04, 2023
Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, Liang-Chieh Chen

Figure 1 for Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Figure 2 for Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Figure 3 for Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Figure 4 for Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
Viaarxiv icon

$R^{2}$Former: Unified $R$etrieval and $R$eranking Transformer for Place Recognition

Apr 06, 2023
Sijie Zhu, Linjie Yang, Chen Chen, Mubarak Shah, Xiaohui Shen, Heng Wang

Figure 1 for $R^{2}$Former: Unified $R$etrieval and $R$eranking Transformer for Place Recognition
Figure 2 for $R^{2}$Former: Unified $R$etrieval and $R$eranking Transformer for Place Recognition
Figure 3 for $R^{2}$Former: Unified $R$etrieval and $R$eranking Transformer for Place Recognition
Figure 4 for $R^{2}$Former: Unified $R$etrieval and $R$eranking Transformer for Place Recognition
Viaarxiv icon

Multimodal Video Adapter for Parameter Efficient Video Text Retrieval

Jan 19, 2023
Bowen Zhang, Xiaojie Jin, Weibo Gong, Kai Xu, Zhao Zhang, Peng Wang, Xiaohui Shen, Jiashi Feng

Figure 1 for Multimodal Video Adapter for Parameter Efficient Video Text Retrieval
Figure 2 for Multimodal Video Adapter for Parameter Efficient Video Text Retrieval
Figure 3 for Multimodal Video Adapter for Parameter Efficient Video Text Retrieval
Figure 4 for Multimodal Video Adapter for Parameter Efficient Video Text Retrieval
Viaarxiv icon

Contrastive Masked Autoencoders are Stronger Vision Learners

Jul 27, 2022
Zhicheng Huang, Xiaojie Jin, Chengze Lu, Qibin Hou, Ming-Ming Cheng, Dongmei Fu, Xiaohui Shen, Jiashi Feng

Figure 1 for Contrastive Masked Autoencoders are Stronger Vision Learners
Figure 2 for Contrastive Masked Autoencoders are Stronger Vision Learners
Figure 3 for Contrastive Masked Autoencoders are Stronger Vision Learners
Figure 4 for Contrastive Masked Autoencoders are Stronger Vision Learners
Viaarxiv icon

SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing

Dec 07, 2021
Yichun Shi, Xiao Yang, Yangyue Wan, Xiaohui Shen

Figure 1 for SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing
Figure 2 for SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing
Figure 3 for SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing
Figure 4 for SemanticStyleGAN: Learning Compositional Generative Priors for Controllable Image Synthesis and Editing
Viaarxiv icon

Video Salient Object Detection via Contrastive Features and Attention Modules

Nov 03, 2021
Yi-Wen Chen, Xiaojie Jin, Xiaohui Shen, Ming-Hsuan Yang

Figure 1 for Video Salient Object Detection via Contrastive Features and Attention Modules
Figure 2 for Video Salient Object Detection via Contrastive Features and Attention Modules
Figure 3 for Video Salient Object Detection via Contrastive Features and Attention Modules
Figure 4 for Video Salient Object Detection via Contrastive Features and Attention Modules
Viaarxiv icon

Adversarial Open Domain Adaption for Sketch-to-Photo Synthesis

Apr 12, 2021
Xiaoyu Xiang, Ding Liu, Xiao Yang, Yiheng Zhu, Xiaohui Shen, Jan P. Allebach

Figure 1 for Adversarial Open Domain Adaption for Sketch-to-Photo Synthesis
Figure 2 for Adversarial Open Domain Adaption for Sketch-to-Photo Synthesis
Figure 3 for Adversarial Open Domain Adaption for Sketch-to-Photo Synthesis
Figure 4 for Adversarial Open Domain Adaption for Sketch-to-Photo Synthesis
Viaarxiv icon

DynOcc: Learning Single-View Depth from Dynamic Occlusion Cues

Mar 30, 2021
Yifan Wang, Linjie Luo, Xiaohui Shen, Xing Mei

Figure 1 for DynOcc: Learning Single-View Depth from Dynamic Occlusion Cues
Figure 2 for DynOcc: Learning Single-View Depth from Dynamic Occlusion Cues
Figure 3 for DynOcc: Learning Single-View Depth from Dynamic Occlusion Cues
Figure 4 for DynOcc: Learning Single-View Depth from Dynamic Occlusion Cues
Viaarxiv icon