Picture for Chongyang Li

Chongyang Li

F2RVLM: Boosting Fine-grained Fragment Retrieval for Multi-Modal Long-form Dialogue with Vision Language Model

Add code
Aug 25, 2025
Viaarxiv icon

ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation

Add code
Jul 03, 2025
Viaarxiv icon

Knowledge-Base based Semantic Image Transmission Using CLIP

Add code
Apr 01, 2025
Viaarxiv icon

Learning to Evaluate Performance of Multi-modal Semantic Localization

Add code
Sep 19, 2022
Figure 1 for Learning to Evaluate Performance of Multi-modal Semantic Localization
Figure 2 for Learning to Evaluate Performance of Multi-modal Semantic Localization
Figure 3 for Learning to Evaluate Performance of Multi-modal Semantic Localization
Figure 4 for Learning to Evaluate Performance of Multi-modal Semantic Localization
Viaarxiv icon