Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Unpaired Image Captioning via Scene Graph Alignments

Apr 04, 2019

Jiuxiang Gu, Shafiq Joty, Jianfei Cai, Handong Zhao, Xu Yang, Gang Wang

Figure 1 for Unpaired Image Captioning via Scene Graph Alignments

Figure 2 for Unpaired Image Captioning via Scene Graph Alignments

Figure 3 for Unpaired Image Captioning via Scene Graph Alignments

Figure 4 for Unpaired Image Captioning via Scene Graph Alignments

Share this with someone who'll enjoy it:

Abstract:Most of the existing deep learning based image captioning methods are fully-supervised models, which require large-scale paired image-caption datasets. However, getting large scale image-caption paired data is labor-intensive and time-consuming. In this paper, we present a scene graph based approach for unpaired image captioning. Our framework comprises an image scene graph generator, a sentence scene graph generator, a scene graph encoder, and a sentence decoder. Specifically, we first train the scene graph encoder and the sentence decoder on the text modality. To align the scene graphs between images and sentences, we propose an unsupervised feature alignment method that maps the scene graph features from the image modality to the sentence modality without any paired data. Experimental results show that our proposed model can generate quite promising results without using any image-caption training pairs, outperforming existing methods by a wide margin.

View paper on

Share this with someone who'll enjoy it:

Title:Unpaired Image Captioning via Scene Graph Alignments

Paper and Code