Picture for Ynan Ding

Ynan Ding

REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding

Add code
Mar 10, 2025
Figure 1 for REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding
Figure 2 for REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding
Figure 3 for REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding
Figure 4 for REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding
Viaarxiv icon