Alert button
Picture for Xingjian He

Xingjian He

Alert button

Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation

Add code
Bookmark button
Alert button
Apr 12, 2024
Yichen Yan, Xingjian He, Sihan Chen, Jing Liu

Viaarxiv icon

SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

Add code
Bookmark button
Alert button
Mar 20, 2024
Tongtian Yue, Jie Cheng, Longteng Guo, Xingyuan Dai, Zijia Zhao, Xingjian He, Gang Xiong, Yisheng Lv, Jing Liu

Figure 1 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 2 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 3 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 4 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Viaarxiv icon

Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions

Add code
Bookmark button
Alert button
Feb 17, 2024
Wenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liu

Viaarxiv icon

Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation

Add code
Bookmark button
Alert button
Dec 13, 2023
Wenxuan Wang, Tongtian Yue, Yisi Zhang, Longteng Guo, Xingjian He, Xinlong Wang, Jing Liu

Viaarxiv icon

EAVL: Explicitly Align Vision and Language for Referring Image Segmentation

Add code
Bookmark button
Alert button
Aug 22, 2023
Yichen Yan, Xingjian He, Wenxuan Wang, Sihan Chen, Jing Liu

Figure 1 for EAVL: Explicitly Align Vision and Language for Referring Image Segmentation
Figure 2 for EAVL: Explicitly Align Vision and Language for Referring Image Segmentation
Figure 3 for EAVL: Explicitly Align Vision and Language for Referring Image Segmentation
Figure 4 for EAVL: Explicitly Align Vision and Language for Referring Image Segmentation
Viaarxiv icon

COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

Add code
Bookmark button
Alert button
Jun 15, 2023
Sihan Chen, Xingjian He, Handong Li, Xiaojie Jin, Jiashi Feng, Jing Liu

Figure 1 for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Figure 2 for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Figure 3 for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Figure 4 for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Viaarxiv icon

MMNet: Multi-Mask Network for Referring Image Segmentation

Add code
Bookmark button
Alert button
May 24, 2023
Yichen Yan, Xingjian He, Wenxuan Wan, Jing Liu

Figure 1 for MMNet: Multi-Mask Network for Referring Image Segmentation
Figure 2 for MMNet: Multi-Mask Network for Referring Image Segmentation
Figure 3 for MMNet: Multi-Mask Network for Referring Image Segmentation
Figure 4 for MMNet: Multi-Mask Network for Referring Image Segmentation
Viaarxiv icon

VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending

Add code
Bookmark button
Alert button
May 22, 2023
Xingjian He, Sihan Chen, Fan Ma, Zhicheng Huang, Xiaojie Jin, Zikang Liu, Dongmei Fu, Yi Yang, Jing Liu, Jiashi Feng

Figure 1 for VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending
Figure 2 for VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending
Figure 3 for VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending
Figure 4 for VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending
Viaarxiv icon

CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation

Add code
Bookmark button
Alert button
May 22, 2023
Wenxuan Wang, Jing Liu, Xingjian He, Yisi Zhang, Chen Chen, Jiachen Shen, Yan Zhang, Jiangyun Li

Figure 1 for CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
Figure 2 for CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
Figure 3 for CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
Figure 4 for CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
Viaarxiv icon

Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner

Add code
Bookmark button
Alert button
May 19, 2023
Zikang Liu, Sihan Chen, Longteng Guo, Handong Li, Xingjian He, Jing Liu

Figure 1 for Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Figure 2 for Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Figure 3 for Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Figure 4 for Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Viaarxiv icon