Alert button
Picture for Xiaosong Zhang

Xiaosong Zhang

Alert button

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Feb 06, 2024
Quan Sun, Jinsheng Wang, Qiying Yu, Yufeng Cui, Fan Zhang, Xiaosong Zhang, Xinlong Wang

Viaarxiv icon

Generative Multimodal Models are In-Context Learners

Dec 20, 2023
Quan Sun, Yufeng Cui, Xiaosong Zhang, Fan Zhang, Qiying Yu, Zhengxiong Luo, Yueze Wang, Yongming Rao, Jingjing Liu, Tiejun Huang, Xinlong Wang

Viaarxiv icon

CapsFusion: Rethinking Image-Text Data at Scale

Nov 02, 2023
Qiying Yu, Quan Sun, Xiaosong Zhang, Yufeng Cui, Fan Zhang, Yue Cao, Xinlong Wang, Jingjing Liu

Viaarxiv icon

Generative Pretraining in Multimodality

Jul 11, 2023
Quan Sun, Qiying Yu, Yufeng Cui, Fan Zhang, Xiaosong Zhang, Yueze Wang, Hongcheng Gao, Jingjing Liu, Tiejun Huang, Xinlong Wang

Figure 1 for Generative Pretraining in Multimodality
Figure 2 for Generative Pretraining in Multimodality
Figure 3 for Generative Pretraining in Multimodality
Figure 4 for Generative Pretraining in Multimodality
Viaarxiv icon

SegGPT: Segmenting Everything In Context

Apr 06, 2023
Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang

Figure 1 for SegGPT: Segmenting Everything In Context
Figure 2 for SegGPT: Segmenting Everything In Context
Figure 3 for SegGPT: Segmenting Everything In Context
Figure 4 for SegGPT: Segmenting Everything In Context
Viaarxiv icon

HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling

May 30, 2022
Xiaosong Zhang, Yunjie Tian, Wei Huang, Qixiang Ye, Qi Dai, Lingxi Xie, Qi Tian

Figure 1 for HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
Figure 2 for HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
Figure 3 for HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
Figure 4 for HiViT: Hierarchical Vision Transformer Meets Masked Image Modeling
Viaarxiv icon

Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection

May 19, 2022
Xiaosong Zhang, Feng Liu, Zhiliang Peng, Zonghao Guo, Fang Wan, Xiangyang Ji, Qixiang Ye

Figure 1 for Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Figure 2 for Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Figure 3 for Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Figure 4 for Integral Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Viaarxiv icon

Long-tailed Distribution Adaptation

Oct 06, 2021
Zhiliang Peng, Wei Huang, Zonghao Guo, Xiaosong Zhang, Jianbin Jiao, Qixiang Ye

Figure 1 for Long-tailed Distribution Adaptation
Figure 2 for Long-tailed Distribution Adaptation
Figure 3 for Long-tailed Distribution Adaptation
Figure 4 for Long-tailed Distribution Adaptation
Viaarxiv icon

FreeAnchor: Learning to Match Anchors for Visual Object Detection

Sep 05, 2019
Xiaosong Zhang, Fang Wan, Chang Liu, Rongrong Ji, Qixiang Ye

Figure 1 for FreeAnchor: Learning to Match Anchors for Visual Object Detection
Figure 2 for FreeAnchor: Learning to Match Anchors for Visual Object Detection
Figure 3 for FreeAnchor: Learning to Match Anchors for Visual Object Detection
Figure 4 for FreeAnchor: Learning to Match Anchors for Visual Object Detection
Viaarxiv icon

Adversarial Samples on Android Malware Detection Systems for IoT Systems

Feb 12, 2019
Xiaolei Liu, Xiaojiang Du, Xiaosong Zhang, Qingxin Zhu, Mohsen Guizani

Figure 1 for Adversarial Samples on Android Malware Detection Systems for IoT Systems
Figure 2 for Adversarial Samples on Android Malware Detection Systems for IoT Systems
Figure 3 for Adversarial Samples on Android Malware Detection Systems for IoT Systems
Figure 4 for Adversarial Samples on Android Malware Detection Systems for IoT Systems
Viaarxiv icon