Semi Supervised Learning For Image Captioning


Describe Anything: Detailed Localized Image and Video Captioning

Add code
Apr 22, 2025
Viaarxiv icon

Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model

Add code
Jan 03, 2025
Figure 1 for Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model
Figure 2 for Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model
Figure 3 for Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model
Figure 4 for Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model
Viaarxiv icon

Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval

Add code
Apr 23, 2024
Figure 1 for Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
Figure 2 for Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
Figure 3 for Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
Figure 4 for Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
Viaarxiv icon

Semi-Supervised Image Captioning Considering Wasserstein Graph Matching

Add code
Mar 26, 2024
Viaarxiv icon

Semi-supervised Text-based Person Search

Add code
Apr 28, 2024
Figure 1 for Semi-supervised Text-based Person Search
Figure 2 for Semi-supervised Text-based Person Search
Figure 3 for Semi-supervised Text-based Person Search
Figure 4 for Semi-supervised Text-based Person Search
Viaarxiv icon

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Add code
Jul 10, 2024
Figure 1 for Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
Figure 2 for Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
Figure 3 for Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
Figure 4 for Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
Viaarxiv icon

NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models

Add code
May 31, 2024
Figure 1 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 2 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 3 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Figure 4 for NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models
Viaarxiv icon

Cycle-Consistency Learning for Captioning and Grounding

Add code
Dec 23, 2023
Viaarxiv icon

SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

Add code
Nov 27, 2023
Viaarxiv icon

Semi-Supervised Image Captioning with CLIP

Add code
Jun 26, 2023
Figure 1 for Semi-Supervised Image Captioning with CLIP
Figure 2 for Semi-Supervised Image Captioning with CLIP
Figure 3 for Semi-Supervised Image Captioning with CLIP
Figure 4 for Semi-Supervised Image Captioning with CLIP
Viaarxiv icon