Alert button
Picture for Yuzhong Zhao

Yuzhong Zhao

Alert button

Controllable Dense Captioner with Multimodal Embedding Bridging

Add code
Bookmark button
Alert button
Feb 01, 2024
Yuzhong Zhao, Yue Liu, Zonghao Guo, Weijia Wu, Chen Gong, Fang Wan, Qixiang Ye

Viaarxiv icon

VMamba: Visual State Space Model

Add code
Bookmark button
Alert button
Jan 18, 2024
Yue Liu, Yunjie Tian, Yuzhong Zhao, Hongtian Yu, Lingxi Xie, Yaowei Wang, Qixiang Ye, Yunfan Liu

Viaarxiv icon

Continual Learning for Image Segmentation with Dynamic Query

Add code
Bookmark button
Alert button
Nov 29, 2023
Weijia Wu, Yuzhong Zhao, Zhuang Li, Lianlei Shan, Hong Zhou, Mike Zheng Shou

Viaarxiv icon

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

Add code
Bookmark button
Alert button
Aug 11, 2023
Weijia Wu, Yuzhong Zhao, Hao Chen, Yuchao Gu, Rui Zhao, Yefei He, Hong Zhou, Mike Zheng Shou, Chunhua Shen

Figure 1 for DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Figure 2 for DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Figure 3 for DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Figure 4 for DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Viaarxiv icon

Generative Prompt Model for Weakly Supervised Object Localization

Add code
Bookmark button
Alert button
Jul 19, 2023
Yuzhong Zhao, Qixiang Ye, Weijia Wu, Chunhua Shen, Fang Wan

Figure 1 for Generative Prompt Model for Weakly Supervised Object Localization
Figure 2 for Generative Prompt Model for Weakly Supervised Object Localization
Figure 3 for Generative Prompt Model for Weakly Supervised Object Localization
Figure 4 for Generative Prompt Model for Weakly Supervised Object Localization
Viaarxiv icon

A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension

Add code
Bookmark button
Alert button
May 05, 2023
Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Hong Zhou, Mike Zheng Shou, Xiang Bai

Figure 1 for A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension
Figure 2 for A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension
Figure 3 for A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension
Figure 4 for A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension
Viaarxiv icon

FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation

Add code
Bookmark button
Alert button
May 05, 2023
Yuzhong Zhao, Weijia Wu, Zhuang Li, Jiahong Li, Weiqiang Wang

Figure 1 for FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
Figure 2 for FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
Figure 3 for FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
Figure 4 for FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
Viaarxiv icon

ICDAR 2023 Video Text Reading Competition for Dense and Small Text

Add code
Bookmark button
Alert button
Apr 10, 2023
Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas, Xiang Bai

Figure 1 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 2 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 3 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 4 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Viaarxiv icon

DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models

Add code
Bookmark button
Alert button
Mar 21, 2023
Weijia Wu, Yuzhong Zhao, Mike Zheng Shou, Hong Zhou, Chunhua Shen

Figure 1 for DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Figure 2 for DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Figure 3 for DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Figure 4 for DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
Viaarxiv icon

Explore Faster Localization Learning For Scene Text Detection

Add code
Bookmark button
Alert button
Jul 04, 2022
Yuzhong Zhao, Yuanqiang Cai, Weijia Wu, Weiqiang Wang

Figure 1 for Explore Faster Localization Learning For Scene Text Detection
Figure 2 for Explore Faster Localization Learning For Scene Text Detection
Figure 3 for Explore Faster Localization Learning For Scene Text Detection
Figure 4 for Explore Faster Localization Learning For Scene Text Detection
Viaarxiv icon