Alert button
Picture for Xiaoyu Yue

Xiaoyu Yue

Alert button

OV-PARTS: Towards Open-Vocabulary Part Segmentation

Add code
Bookmark button
Alert button
Oct 08, 2023
Meng Wei, Xiaoyu Yue, Wenwei Zhang, Shu Kong, Xihui Liu, Jiangmiao Pang

Figure 1 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 2 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 3 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Figure 4 for OV-PARTS: Towards Open-Vocabulary Part Segmentation
Viaarxiv icon

Understanding Masked Autoencoders From a Local Contrastive Perspective

Add code
Bookmark button
Alert button
Oct 03, 2023
Xiaoyu Yue, Lei Bai, Meng Wei, Jiangmiao Pang, Xihui Liu, Luping Zhou, Wanli Ouyang

Figure 1 for Understanding Masked Autoencoders From a Local Contrastive Perspective
Figure 2 for Understanding Masked Autoencoders From a Local Contrastive Perspective
Figure 3 for Understanding Masked Autoencoders From a Local Contrastive Perspective
Figure 4 for Understanding Masked Autoencoders From a Local Contrastive Perspective
Viaarxiv icon

In Defense of Clip-based Video Relation Detection

Add code
Bookmark button
Alert button
Jul 18, 2023
Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Roger Zimmermann

Figure 1 for In Defense of Clip-based Video Relation Detection
Figure 2 for In Defense of Clip-based Video Relation Detection
Figure 3 for In Defense of Clip-based Video Relation Detection
Figure 4 for In Defense of Clip-based Video Relation Detection
Viaarxiv icon

Rethinking the Two-Stage Framework for Grounded Situation Recognition

Add code
Bookmark button
Alert button
Dec 10, 2021
Meng Wei, Long Chen, Wei Ji, Xiaoyu Yue, Tat-Seng Chua

Figure 1 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 2 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 3 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Figure 4 for Rethinking the Two-Stage Framework for Grounded Situation Recognition
Viaarxiv icon

MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding

Add code
Bookmark button
Alert button
Aug 14, 2021
Zhanghui Kuang, Hongbin Sun, Zhizhong Li, Xiaoyu Yue, Tsui Hin Lin, Jianyong Chen, Huaqiang Wei, Yiqin Zhu, Tong Gao, Wenwei Zhang, Kai Chen, Wayne Zhang, Dahua Lin

Figure 1 for MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
Figure 2 for MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
Figure 3 for MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
Figure 4 for MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding
Viaarxiv icon

Vision Transformer with Progressive Sampling

Add code
Bookmark button
Alert button
Aug 03, 2021
Xiaoyu Yue, Shuyang Sun, Zhanghui Kuang, Meng Wei, Philip Torr, Wayne Zhang, Dahua Lin

Figure 1 for Vision Transformer with Progressive Sampling
Figure 2 for Vision Transformer with Progressive Sampling
Figure 3 for Vision Transformer with Progressive Sampling
Figure 4 for Vision Transformer with Progressive Sampling
Viaarxiv icon

Spatial Dual-Modality Graph Reasoning for Key Information Extraction

Add code
Bookmark button
Alert button
Mar 26, 2021
Hongbin Sun, Zhanghui Kuang, Xiaoyu Yue, Chenhao Lin, Wayne Zhang

Figure 1 for Spatial Dual-Modality Graph Reasoning for Key Information Extraction
Figure 2 for Spatial Dual-Modality Graph Reasoning for Key Information Extraction
Figure 3 for Spatial Dual-Modality Graph Reasoning for Key Information Extraction
Figure 4 for Spatial Dual-Modality Graph Reasoning for Key Information Extraction
Viaarxiv icon

HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation

Add code
Bookmark button
Alert button
Aug 12, 2020
Meng Wei, Chun Yuan, Xiaoyu Yue, Kuo Zhong

Figure 1 for HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation
Figure 2 for HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation
Figure 3 for HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation
Figure 4 for HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation
Viaarxiv icon

RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition

Add code
Bookmark button
Alert button
Jul 17, 2020
Xiaoyu Yue, Zhanghui Kuang, Chenhao Lin, Hongbin Sun, Wayne Zhang

Figure 1 for RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition
Figure 2 for RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition
Figure 3 for RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition
Figure 4 for RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition
Viaarxiv icon