Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

Jun 02, 2016

Yu Liu, Jianlong Fu, Tao Mei, Chang Wen Chen

Figure 1 for Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

Figure 2 for Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

Figure 3 for Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

Figure 4 for Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

Share this with someone who'll enjoy it:

Abstract:Visual storytelling aims to generate human-level narrative language (i.e., a natural paragraph with multiple sentences) from a photo streams. A typical photo story consists of a global timeline with multi-thread local storylines, where each storyline occurs in one different scene. Such complex structure leads to large content gaps at scene transitions between consecutive photos. Most existing image/video captioning methods can only achieve limited performance, because the units in traditional recurrent neural networks (RNN) tend to "forget" the previous state when the visual sequence is inconsistent. In this paper, we propose a novel visual storytelling approach with Bidirectional Multi-thread Recurrent Neural Network (BMRNN). First, based on the mined local storylines, a skip gated recurrent unit (sGRU) with delay control is proposed to maintain longer range visual information. Second, by using sGRU as basic units, the BMRNN is trained to align the local storylines into the global sequential timeline. Third, a new training scheme with a storyline-constrained objective function is proposed by jointly considering both global and local matches. Experiments on three standard storytelling datasets show that the BMRNN model outperforms the state-of-the-art methods.

View paper on

Share this with someone who'll enjoy it:

Title:Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network

Paper and Code