Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Christopher Thomas

Automatic Understanding of Image and Video Advertisements

Jul 10, 2017

Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka

Figure 1 for Automatic Understanding of Image and Video Advertisements

Figure 2 for Automatic Understanding of Image and Video Advertisements

Figure 3 for Automatic Understanding of Image and Video Advertisements

Figure 4 for Automatic Understanding of Image and Video Advertisements

Abstract:There is more to images than their objective physical content: for example, advertisements are created to persuade a viewer to take a certain action. We propose the novel problem of automatic advertisement understanding. To enable research on this problem, we create two datasets: an image dataset of 64,832 image ads, and a video dataset of 3,477 ads. Our data contains rich annotations encompassing the topic and sentiment of the ads, questions and answers describing what actions the viewer is prompted to take and the reasoning that the ad presents to persuade the viewer ("What should I do according to this ad, and why should I do it?"), and symbolic references ads make (e.g. a dove symbolizes peace). We also analyze the most common persuasive strategies ads use, and the capabilities that computer vision systems should have to understand these strategies. We present baseline classification results for several prediction tasks, including automatically answering questions about the messages of the ads.

* To appear in CVPR 2017; data available on http://cs.pitt.edu/~kovashka/ads

Via

Access Paper or Ask Questions

OpenSalicon: An Open Source Implementation of the Salicon Saliency Model

Jun 01, 2016

Christopher Thomas

Figure 1 for OpenSalicon: An Open Source Implementation of the Salicon Saliency Model

Figure 2 for OpenSalicon: An Open Source Implementation of the Salicon Saliency Model

Abstract:In this technical report, we present our publicly downloadable implementation of the SALICON saliency model. At the time of this writing, SALICON is one of the top performing saliency models on the MIT 300 fixation prediction dataset which evaluates how well an algorithm is able to predict where humans would look in a given image. Recently, numerous models have achieved state-of-the-art performance on this benchmark, but none of the top 5 performing models (including SALICON) are available for download. To address this issue, we have created a publicly downloadable implementation of the SALICON model. It is our hope that our model will engender further research in visual attention modeling by providing a baseline for comparison of other algorithms and a platform for extending this implementation. The model we provide supports both training and testing, enabling researchers to quickly fine-tune the model on their own dataset. We also provide a pre-trained model and code for those users who only need to generate saliency maps for images without training their own model.

* Github Repository: https://github.com/CLT29/OpenSALICON

Via

Access Paper or Ask Questions

Seeing Behind the Camera: Identifying the Authorship of a Photograph

Jun 01, 2016

Christopher Thomas, Adriana Kovashka

Figure 1 for Seeing Behind the Camera: Identifying the Authorship of a Photograph

Figure 2 for Seeing Behind the Camera: Identifying the Authorship of a Photograph

Figure 3 for Seeing Behind the Camera: Identifying the Authorship of a Photograph

Figure 4 for Seeing Behind the Camera: Identifying the Authorship of a Photograph

Abstract:We introduce the novel problem of identifying the photographer behind a photograph. To explore the feasibility of current computer vision techniques to address this problem, we created a new dataset of over 180,000 images taken by 41 well-known photographers. Using this dataset, we examined the effectiveness of a variety of features (low and high-level, including CNN features) at identifying the photographer. We also trained a new deep convolutional neural network for this task. Our results show that high-level features greatly outperform low-level features. We provide qualitative results using these learned models that give insight into our method's ability to distinguish between photographers, and allow us to draw interesting conclusions about what specific photographers shoot. We also demonstrate two applications of our method.

* Dataset downloadable at http://www.cs.pitt.edu/~chris/photographer To Appear in CVPR 2016

Via

Access Paper or Ask Questions