Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling

Dec 03, 2020

Jing Su, Qingyun Dai, Frank Guerin, Mian Zhou

Figure 1 for BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling

Figure 2 for BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling

Figure 3 for BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling

Figure 4 for BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling

Share this with someone who'll enjoy it:

Abstract:Visual storytelling is a creative and challenging task, aiming to automatically generate a story-like description for a sequence of images. The descriptions generated by previous visual storytelling approaches lack coherence because they use word-level sequence generation methods and do not adequately consider sentence-level dependencies. To tackle this problem, we propose a novel hierarchical visual storytelling framework which separately models sentence-level and word-level semantics. We use the transformer-based BERT to obtain embeddings for sentences and words. We then employ a hierarchical LSTM network: the bottom LSTM receives as input the sentence vector representation from BERT, to learn the dependencies between the sentences corresponding to images, and the top LSTM is responsible for generating the corresponding word vector representations, taking input from the bottom LSTM. Experimental results demonstrate that our model outperforms most closely related baselines under automatic evaluation metrics BLEU and CIDEr, and also show the effectiveness of our method with human evaluation.

View paper on

Share this with someone who'll enjoy it:

Title:BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling

Paper and Code