Alert button
Picture for Xiyang Dai

Xiyang Dai

Alert button

Rewrite the Stars

Add code
Bookmark button
Alert button
Mar 29, 2024
Xu Ma, Xiyang Dai, Yue Bai, Yizhou Wang, Yun Fu

Figure 1 for Rewrite the Stars
Figure 2 for Rewrite the Stars
Figure 3 for Rewrite the Stars
Figure 4 for Rewrite the Stars
Viaarxiv icon

Efficient Modulation for Vision Networks

Add code
Bookmark button
Alert button
Mar 29, 2024
Xu Ma, Xiyang Dai, Jianwei Yang, Bin Xiao, Yinpeng Chen, Yun Fu, Lu Yuan

Figure 1 for Efficient Modulation for Vision Networks
Figure 2 for Efficient Modulation for Vision Networks
Figure 3 for Efficient Modulation for Vision Networks
Figure 4 for Efficient Modulation for Vision Networks
Viaarxiv icon

Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search

Add code
Bookmark button
Alert button
Mar 15, 2024
Hongyuan Yu, Cheng Wan, Mengchen Liu, Dongdong Chen, Bin Xiao, Xiyang Dai

Figure 1 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 2 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 3 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 4 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Viaarxiv icon

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Add code
Bookmark button
Alert button
Nov 10, 2023
Bin Xiao, Haiping Wu, Weijian Xu, Xiyang Dai, Houdong Hu, Yumao Lu, Michael Zeng, Ce Liu, Lu Yuan

Figure 1 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 2 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 3 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Figure 4 for Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Viaarxiv icon

On the Hidden Waves of Image

Add code
Bookmark button
Alert button
Oct 19, 2023
Yinpeng Chen, Dongdong Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin

Figure 1 for On the Hidden Waves of Image
Figure 2 for On the Hidden Waves of Image
Figure 3 for On the Hidden Waves of Image
Figure 4 for On the Hidden Waves of Image
Viaarxiv icon

LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following

Add code
Bookmark button
Alert button
Oct 18, 2023
Cheng-Fu Yang, Yen-Chun Chen, Jianwei Yang, Xiyang Dai, Lu Yuan, Yu-Chiang Frank Wang, Kai-Wei Chang

Viaarxiv icon

Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection

Add code
Bookmark button
Alert button
Oct 18, 2023
Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Figure 1 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 2 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 3 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 4 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Viaarxiv icon

Image is First-order Norm+Linear Autoregressive

Add code
Bookmark button
Alert button
May 25, 2023
Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin

Figure 1 for Image is First-order Norm+Linear Autoregressive
Figure 2 for Image is First-order Norm+Linear Autoregressive
Figure 3 for Image is First-order Norm+Linear Autoregressive
Figure 4 for Image is First-order Norm+Linear Autoregressive
Viaarxiv icon

ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System

Add code
Bookmark button
Alert button
Apr 29, 2023
Junke Wang, Dongdong Chen, Chong Luo, Xiyang Dai, Lu Yuan, Zuxuan Wu, Yu-Gang Jiang

Figure 1 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 2 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 3 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Figure 4 for ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System
Viaarxiv icon