Alert button
Picture for Alex Jinpeng Wang

Alex Jinpeng Wang

Alert button

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Add code
Bookmark button
Alert button
Jan 01, 2024
Alex Jinpeng Wang, Linjie Li, Kevin Qinghong Lin, Jianfeng Wang, Kevin Lin, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou

Viaarxiv icon

Parrot Captions Teach CLIP to Spot Text

Add code
Bookmark button
Alert button
Dec 28, 2023
Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou

Viaarxiv icon

UniVTG: Towards Unified Video-Language Temporal Grounding

Add code
Bookmark button
Alert button
Aug 18, 2023
Kevin Qinghong Lin, Pengchuan Zhang, Joya Chen, Shraman Pramanick, Difei Gao, Alex Jinpeng Wang, Rui Yan, Mike Zheng Shou

Figure 1 for UniVTG: Towards Unified Video-Language Temporal Grounding
Figure 2 for UniVTG: Towards Unified Video-Language Temporal Grounding
Figure 3 for UniVTG: Towards Unified Video-Language Temporal Grounding
Figure 4 for UniVTG: Towards Unified Video-Language Temporal Grounding
Viaarxiv icon

Too Large; Data Reduction for Vision-Language Pre-Training

Add code
Bookmark button
Alert button
Jun 01, 2023
Alex Jinpeng Wang, Kevin Qinghong Lin, David Junhao Zhang, Stan Weixian Lei, Mike Zheng Shou

Figure 1 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 2 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 3 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 4 for Too Large; Data Reduction for Vision-Language Pre-Training
Viaarxiv icon

Position-guided Text Prompt for Vision-Language Pre-training

Add code
Bookmark button
Alert button
Dec 19, 2022
Alex Jinpeng Wang, Pan Zhou, Mike Zheng Shou, Shuicheng Yan

Figure 1 for Position-guided Text Prompt for Vision-Language Pre-training
Figure 2 for Position-guided Text Prompt for Vision-Language Pre-training
Figure 3 for Position-guided Text Prompt for Vision-Language Pre-training
Figure 4 for Position-guided Text Prompt for Vision-Language Pre-training
Viaarxiv icon

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

Add code
Bookmark button
Alert button
Jul 04, 2022
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Figure 1 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 2 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 3 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 4 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Viaarxiv icon

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

Add code
Bookmark button
Alert button
Jul 04, 2022
Kevin Qinghong Lin, Alex Jinpeng Wang, Rui Yan, Eric Zhongcong Xu, Rongcheng Tu, Yanru Zhu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Wei Liu, Mike Zheng Shou

Figure 1 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 2 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 3 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 4 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Viaarxiv icon

Egocentric Video-Language Pretraining

Add code
Bookmark button
Alert button
Jun 03, 2022
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Figure 1 for Egocentric Video-Language Pretraining
Figure 2 for Egocentric Video-Language Pretraining
Figure 3 for Egocentric Video-Language Pretraining
Figure 4 for Egocentric Video-Language Pretraining
Viaarxiv icon