Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Robert Pless

Hotels-50K: A Global Hotel Recognition Dataset

Jan 26, 2019

Abby Stylianou, Hong Xuan, Maya Shende, Jonathan Brandt, Richard Souvenir, Robert Pless

Figure 1 for Hotels-50K: A Global Hotel Recognition Dataset

Figure 2 for Hotels-50K: A Global Hotel Recognition Dataset

Figure 3 for Hotels-50K: A Global Hotel Recognition Dataset

Figure 4 for Hotels-50K: A Global Hotel Recognition Dataset

Abstract:Recognizing a hotel from an image of a hotel room is important for human trafficking investigations. Images directly link victims to places and can help verify where victims have been trafficked, and where their traffickers might move them or others in the future. Recognizing the hotel from images is challenging because of low image quality, uncommon camera perspectives, large occlusions (often the victim), and the similarity of objects (e.g., furniture, art, bedding) across different hotel rooms. To support efforts towards this hotel recognition task, we have curated a dataset of over 1 million annotated hotel room images from 50,000 hotels. These images include professionally captured photographs from travel websites and crowd-sourced images from a mobile application, which are more similar to the types of images analyzed in real-world investigations. We present a baseline approach based on a standard network architecture and a collection of data-augmentation approaches tuned to this problem domain.

Via

Access Paper or Ask Questions

Visualizing Deep Similarity Networks

Jan 02, 2019

Abby Stylianou, Richard Souvenir, Robert Pless

Figure 1 for Visualizing Deep Similarity Networks

Figure 2 for Visualizing Deep Similarity Networks

Figure 3 for Visualizing Deep Similarity Networks

Figure 4 for Visualizing Deep Similarity Networks

Abstract:For convolutional neural network models that optimize an image embedding, we propose a method to highlight the regions of images that contribute most to pairwise similarity. This work is a corollary to the visualization tools developed for classification networks, but applicable to the problem domains better suited to similarity learning. The visualization shows how similarity networks that are fine-tuned learn to focus on different features. We also generalize our approach to embedding networks that use different pooling strategies and provide a simple mechanism to support image similarity searches on objects or sub-regions in the query image.

Via

Access Paper or Ask Questions

Deep Randomized Ensembles for Metric Learning

Sep 04, 2018

Hong Xuan, Richard Souvenir, Robert Pless

Figure 1 for Deep Randomized Ensembles for Metric Learning

Figure 2 for Deep Randomized Ensembles for Metric Learning

Figure 3 for Deep Randomized Ensembles for Metric Learning

Figure 4 for Deep Randomized Ensembles for Metric Learning

Abstract:Learning embedding functions, which map semantically related inputs to nearby locations in a feature space supports a variety of classification and information retrieval tasks. In this work, we propose a novel, generalizable and fast method to define a family of embedding functions that can be used as an ensemble to give improved results. Each embedding function is learned by randomly bagging the training labels into small subsets. We show experimentally that these embedding ensembles create effective embedding functions. The ensemble output defines a metric space that improves state of the art performance for image retrieval on CUB-200-2011, Cars-196, In-Shop Clothes Retrieval and VehicleID.

* ECCV 2018

Via

Access Paper or Ask Questions

Deep Feature Interpolation for Image Content Changes

Jun 19, 2017

Paul Upchurch, Jacob Gardner, Geoff Pleiss, Robert Pless, Noah Snavely, Kavita Bala, Kilian Weinberger

Figure 1 for Deep Feature Interpolation for Image Content Changes

Figure 2 for Deep Feature Interpolation for Image Content Changes

Figure 3 for Deep Feature Interpolation for Image Content Changes

Figure 4 for Deep Feature Interpolation for Image Content Changes

Abstract:We propose Deep Feature Interpolation (DFI), a new data-driven baseline for automatic high-resolution image transformation. As the name suggests, it relies only on simple linear interpolation of deep convolutional features from pre-trained convnets. We show that despite its simplicity, DFI can perform high-level semantic transformations like "make older/younger", "make bespectacled", "add smile", among others, surprisingly well - sometimes even matching or outperforming the state-of-the-art. This is particularly unexpected as DFI requires no specialized network architecture or even any deep network to be trained for these tasks. DFI therefore can be used as a new baseline to evaluate more complex algorithms and provides a practical answer to the question of which image transformation tasks are still challenging in the rise of deep learning.

* First two authors contributed equally. Accepted by CVPR 2017. Code at https://github.com/paulu/deepfeatinterp

Via

Access Paper or Ask Questions

Shadow Estimation Method for "The Episolar Constraint: Monocular Shape from Shadow Correspondence"

Apr 15, 2013

Austin Abrams, Chris Hawley, Kylia Miskell, Adina Stoica, Nathan Jacobs, Robert Pless

Figure 1 for Shadow Estimation Method for "The Episolar Constraint: Monocular Shape from Shadow Correspondence"

Figure 2 for Shadow Estimation Method for "The Episolar Constraint: Monocular Shape from Shadow Correspondence"

Figure 3 for Shadow Estimation Method for "The Episolar Constraint: Monocular Shape from Shadow Correspondence"

Figure 4 for Shadow Estimation Method for "The Episolar Constraint: Monocular Shape from Shadow Correspondence"

Abstract:Recovering shadows is an important step for many vision algorithms. Current approaches that work with time-lapse sequences are limited to simple thresholding heuristics. We show these approaches only work with very careful tuning of parameters, and do not work well for long-term time-lapse sequences taken over the span of many months. We introduce a parameter-free expectation maximization approach which simultaneously estimates shadows, albedo, surface normals, and skylight. This approach is more accurate than previous methods, works over both very short and very long sequences, and is robust to the effects of nonlinear camera response. Finally, we demonstrate that the shadow masks derived through this algorithm substantially improve the performance of sun-based photometric stereo compared to earlier shadow mask estimation.

Via

Access Paper or Ask Questions