Alert button
Picture for Xiyang Dai

Xiyang Dai

Alert button

Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search

Mar 15, 2024
Hongyuan Yu, Cheng Wan, Mengchen Liu, Dongdong Chen, Bin Xiao, Xiyang Dai

Viaarxiv icon

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Nov 10, 2023
Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan

Figure 1 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 2 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 3 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 4 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Viaarxiv icon

On the Hidden Waves of Image

Oct 19, 2023
Yinpeng Chen, Dongdong Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin

Figure 1 for On the Hidden Waves of Image
Figure 2 for On the Hidden Waves of Image
Figure 3 for On the Hidden Waves of Image
Figure 4 for On the Hidden Waves of Image
Viaarxiv icon

LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following

Oct 18, 2023
Cheng-Fu Yang, Yen-Chun Chen, Jianwei Yang, Xiyang Dai, Lu Yuan, Yu-Chiang Frank Wang, Kai-Wei Chang

Viaarxiv icon

Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection

Oct 18, 2023
Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Figure 1 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 2 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 3 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 4 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Viaarxiv icon

Image is First-order Norm+Linear Autoregressive

May 25, 2023
Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin

Figure 1 for Image is First-order Norm+Linear Autoregressive
Figure 2 for Image is First-order Norm+Linear Autoregressive
Figure 3 for Image is First-order Norm+Linear Autoregressive
Figure 4 for Image is First-order Norm+Linear Autoregressive
Viaarxiv icon

ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System

Apr 29, 2023
Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 2 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 3 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 4 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Viaarxiv icon

OmniTracker: Unifying Object Tracking by Tracking-with-Detection

Mar 21, 2023
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang

Figure 1 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 2 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 3 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 4 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Viaarxiv icon

Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations

Feb 27, 2023
Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang

Figure 1 for Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Figure 2 for Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Figure 3 for Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Figure 4 for Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Viaarxiv icon