Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aditya Khosla

ImageNet Large Scale Visual Recognition Challenge

Jan 30, 2015

Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein(+2 more)

Figure 1 for ImageNet Large Scale Visual Recognition Challenge

Figure 2 for ImageNet Large Scale Visual Recognition Challenge

Figure 3 for ImageNet Large Scale Visual Recognition Challenge

Figure 4 for ImageNet Large Scale Visual Recognition Challenge

Abstract:The ImageNet Large Scale Visual Recognition Challenge is a benchmark in object category classification and detection on hundreds of object categories and millions of images. The challenge has been run annually from 2010 to present, attracting participation from more than fifty institutions. This paper describes the creation of this benchmark dataset and the advances in object recognition that have been possible as a result. We discuss the challenges of collecting large-scale ground truth annotation, highlight key breakthroughs in categorical object recognition, provide a detailed analysis of the current state of the field of large-scale image classification and object detection, and compare the state-of-the-art computer vision accuracy with human accuracy. We conclude with lessons learned in the five years of the challenge, and propose future directions and improvements.

* 43 pages, 16 figures. v3 includes additional comparisons with PASCAL VOC (per-category comparisons in Table 3, distribution of localization difficulty in Fig 16), a list of queries used for obtaining object detection images (Appendix C), and some additional references

Via

Access Paper or Ask Questions

Inverting and Visualizing Features for Object Detection

May 05, 2013

Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba

Figure 1 for Inverting and Visualizing Features for Object Detection

Figure 2 for Inverting and Visualizing Features for Object Detection

Figure 3 for Inverting and Visualizing Features for Object Detection

Figure 4 for Inverting and Visualizing Features for Object Detection

Abstract:We introduce algorithms to visualize feature spaces used by object detectors. The tools in this paper allow a human to put on `HOG goggles' and perceive the visual world as a HOG based object detector sees it. We found that these visualizations allow us to analyze object detection systems in new ways and gain new insight into the detector's failures. For example, when we visualize the features for high scoring false alarms, we discovered that, although they are clearly wrong in image space, they do look deceptively similar to true positives in feature space. This result suggests that many of these false alarms are caused by our choice of feature space, and indicates that creating a better learning algorithm or building bigger datasets is unlikely to correct these errors. By visualizing feature spaces, we can gain a more intuitive understanding of our detection systems.

* This paper is a preprint of our conference paper. We have made it available early in the hopes that others find it useful

Via

Access Paper or Ask Questions