Alert button
Picture for Xingyi Zhou

Xingyi Zhou

Alert button

Streaming Dense Video Captioning

Add code
Bookmark button
Alert button
Apr 01, 2024
Xingyi Zhou, Anurag Arnab, Shyamal Buch, Shen Yan, Austin Myers, Xuehan Xiong, Arsha Nagrani, Cordelia Schmid

Viaarxiv icon

Distilling Vision-Language Models on Millions of Videos

Add code
Bookmark button
Alert button
Jan 11, 2024
Yue Zhao, Long Zhao, Xingyi Zhou, Jialin Wu, Chun-Te Chu, Hui Miao, Florian Schroff, Hartwig Adam, Ting Liu, Boqing Gong, Philipp Krähenbühl, Liangzhe Yuan

Viaarxiv icon

Pixel Aligned Language Models

Add code
Bookmark button
Alert button
Dec 14, 2023
Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid

Viaarxiv icon

MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation

Add code
Bookmark button
Alert button
Dec 11, 2023
Abdullah Rashwan, Jiageng Zhang, Ali Taalimi, Fan Yang, Xingyi Zhou, Chaochao Yan, Liang-Chieh Chen, Yeqing Li

Figure 1 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 2 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 3 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 4 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Viaarxiv icon

Does Visual Pretraining Help End-to-End Reasoning?

Add code
Bookmark button
Alert button
Jul 17, 2023
Chen Sun, Calvin Luo, Xingyi Zhou, Anurag Arnab, Cordelia Schmid

Figure 1 for Does Visual Pretraining Help End-to-End Reasoning?
Figure 2 for Does Visual Pretraining Help End-to-End Reasoning?
Figure 3 for Does Visual Pretraining Help End-to-End Reasoning?
Figure 4 for Does Visual Pretraining Help End-to-End Reasoning?
Viaarxiv icon

Dense Video Object Captioning from Disjoint Supervision

Add code
Bookmark button
Alert button
Jun 20, 2023
Xingyi Zhou, Anurag Arnab, Chen Sun, Cordelia Schmid

Figure 1 for Dense Video Object Captioning from Disjoint Supervision
Figure 2 for Dense Video Object Captioning from Disjoint Supervision
Figure 3 for Dense Video Object Captioning from Disjoint Supervision
Figure 4 for Dense Video Object Captioning from Disjoint Supervision
Viaarxiv icon

How can objects help action recognition?

Add code
Bookmark button
Alert button
Jun 20, 2023
Xingyi Zhou, Anurag Arnab, Chen Sun, Cordelia Schmid

Figure 1 for How can objects help action recognition?
Figure 2 for How can objects help action recognition?
Figure 3 for How can objects help action recognition?
Figure 4 for How can objects help action recognition?
Viaarxiv icon

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Add code
Bookmark button
Alert button
Jun 02, 2023
Xiuye Gu, Yin Cui, Jonathan Huang, Abdullah Rashwan, Xuan Yang, Xingyi Zhou, Golnaz Ghiasi, Weicheng Kuo, Huizhong Chen, Liang-Chieh Chen, David A Ross

Figure 1 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 2 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 3 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 4 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Viaarxiv icon

NMS Strikes Back

Add code
Bookmark button
Alert button
Dec 12, 2022
Jeffrey Ouyang-Zhang, Jang Hyun Cho, Xingyi Zhou, Philipp Krähenbühl

Figure 1 for NMS Strikes Back
Figure 2 for NMS Strikes Back
Figure 3 for NMS Strikes Back
Figure 4 for NMS Strikes Back
Viaarxiv icon

Global Tracking Transformers

Add code
Bookmark button
Alert button
Mar 24, 2022
Xingyi Zhou, Tianwei Yin, Vladlen Koltun, Phillip Krähenbühl

Figure 1 for Global Tracking Transformers
Figure 2 for Global Tracking Transformers
Figure 3 for Global Tracking Transformers
Figure 4 for Global Tracking Transformers
Viaarxiv icon