Alert button
Picture for Xiaojie Jin

Xiaojie Jin

Alert button

Video Recognition in Portrait Mode

Add code
Bookmark button
Alert button
Dec 21, 2023
Mingfei Han, Linjie Yang, Xiaojie Jin, Jiashi Feng, Xiaojun Chang, Heng Wang

Viaarxiv icon

Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens

Add code
Bookmark button
Alert button
Dec 12, 2023
Fan Ma, Xiaojie Jin, Heng Wang, Yuchen Xian, Jiashi Feng, Yi Yang

Viaarxiv icon

PixelLM: Pixel Reasoning with Large Multimodal Model

Add code
Bookmark button
Alert button
Dec 04, 2023
Zhongwei Ren, Zhicheng Huang, Yunchao Wei, Yao Zhao, Dongmei Fu, Jiashi Feng, Xiaojie Jin

Viaarxiv icon

Selective Feature Adapter for Dense Vision Transformers

Add code
Bookmark button
Alert button
Oct 03, 2023
Xueqing Deng, Qi Fan, Xiaojie Jin, Linjie Yang, Peng Wang

Figure 1 for Selective Feature Adapter for Dense Vision Transformers
Figure 2 for Selective Feature Adapter for Dense Vision Transformers
Figure 3 for Selective Feature Adapter for Dense Vision Transformers
Figure 4 for Selective Feature Adapter for Dense Vision Transformers
Viaarxiv icon

Realistic Full-Body Tracking from Sparse Observations via Joint-Level Modeling

Add code
Bookmark button
Alert button
Aug 17, 2023
Xiaozheng Zheng, Zhuo Su, Chao Wen, Zhou Xue, Xiaojie Jin

Figure 1 for Realistic Full-Body Tracking from Sparse Observations via Joint-Level Modeling
Figure 2 for Realistic Full-Body Tracking from Sparse Observations via Joint-Level Modeling
Figure 3 for Realistic Full-Body Tracking from Sparse Observations via Joint-Level Modeling
Figure 4 for Realistic Full-Body Tracking from Sparse Observations via Joint-Level Modeling
Viaarxiv icon

COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

Add code
Bookmark button
Alert button
Jun 15, 2023
Sihan Chen, Xingjian He, Handong Li, Xiaojie Jin, Jiashi Feng, Jing Liu

Figure 1 for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Figure 2 for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Figure 3 for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Figure 4 for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Viaarxiv icon

Delving Deeper into Data Scaling in Masked Image Modeling

Add code
Bookmark button
Alert button
May 24, 2023
Cheng-Ze Lu, Xiaojie Jin, Qibin Hou, Jun Hao Liew, Ming-Ming Cheng, Jiashi Feng

Figure 1 for Delving Deeper into Data Scaling in Masked Image Modeling
Figure 2 for Delving Deeper into Data Scaling in Masked Image Modeling
Figure 3 for Delving Deeper into Data Scaling in Masked Image Modeling
Figure 4 for Delving Deeper into Data Scaling in Masked Image Modeling
Viaarxiv icon

VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending

Add code
Bookmark button
Alert button
May 22, 2023
Xingjian He, Sihan Chen, Fan Ma, Zhicheng Huang, Xiaojie Jin, Zikang Liu, Dongmei Fu, Yi Yang, Jing Liu, Jiashi Feng

Figure 1 for VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending
Figure 2 for VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending
Figure 3 for VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending
Figure 4 for VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending
Viaarxiv icon

Multimodal Video Adapter for Parameter Efficient Video Text Retrieval

Add code
Bookmark button
Alert button
Jan 19, 2023
Bowen Zhang, Xiaojie Jin, Weibo Gong, Kai Xu, Zhao Zhang, Peng Wang, Xiaohui Shen, Jiashi Feng

Figure 1 for Multimodal Video Adapter for Parameter Efficient Video Text Retrieval
Figure 2 for Multimodal Video Adapter for Parameter Efficient Video Text Retrieval
Figure 3 for Multimodal Video Adapter for Parameter Efficient Video Text Retrieval
Figure 4 for Multimodal Video Adapter for Parameter Efficient Video Text Retrieval
Viaarxiv icon

Temporal Perceiving Video-Language Pre-training

Add code
Bookmark button
Alert button
Jan 18, 2023
Fan Ma, Xiaojie Jin, Heng Wang, Jingjia Huang, Linchao Zhu, Jiashi Feng, Yi Yang

Figure 1 for Temporal Perceiving Video-Language Pre-training
Figure 2 for Temporal Perceiving Video-Language Pre-training
Figure 3 for Temporal Perceiving Video-Language Pre-training
Figure 4 for Temporal Perceiving Video-Language Pre-training
Viaarxiv icon