Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Abhishek Das

Grad-CAM: Why did you say that?

Jan 25, 2017

Ramprasaath R Selvaraju, Abhishek Das, Ramakrishna Vedantam, Michael Cogswell, Devi Parikh, Dhruv Batra

Figure 1 for Grad-CAM: Why did you say that?

Figure 2 for Grad-CAM: Why did you say that?

Figure 3 for Grad-CAM: Why did you say that?

Figure 4 for Grad-CAM: Why did you say that?

Abstract:We propose a technique for making Convolutional Neural Network (CNN)-based models more transparent by visualizing input regions that are 'important' for predictions -- or visual explanations. Our approach, called Gradient-weighted Class Activation Mapping (Grad-CAM), uses class-specific gradient information to localize important regions. These localizations are combined with existing pixel-space visualizations to create a novel high-resolution and class-discriminative visualization called Guided Grad-CAM. These methods help better understand CNN-based models, including image captioning and visual question answering (VQA) models. We evaluate our visual explanations by measuring their ability to discriminate between classes, to inspire trust in humans, and their correlation with occlusion maps. Grad-CAM provides a new way to understand CNN-based models. We have released code, an online demo hosted on CloudCV, and a full version of this extended abstract.

* Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems. This is an extended abstract version of arXiv:1610.02391 (CVPR format)

Via

Access Paper or Ask Questions

Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

Jun 17, 2016

Abhishek Das, Harsh Agrawal, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra

Figure 1 for Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

Figure 2 for Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

Figure 3 for Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

Figure 4 for Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?

Abstract:We conduct large-scale studies on `human attention' in Visual Question Answering (VQA) to understand where humans choose to look to answer questions about images. We design and test multiple game-inspired novel attention-annotation interfaces that require the subject to sharpen regions of a blurred image to answer a question. Thus, we introduce the VQA-HAT (Human ATtention) dataset. We evaluate attention maps generated by state-of-the-art VQA models against human attention both qualitatively (via visualizations) and quantitatively (via rank-order correlation). Overall, our experiments show that current attention models in VQA do not seem to be looking at the same regions as humans.

* 5 pages, 4 figures, 3 tables, presented at 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, NY. arXiv admin note: substantial text overlap with arXiv:1606.03556

Via

Access Paper or Ask Questions

Elimination of Specular reflection and Identification of ROI: The First Step in Automated Detection of Cervical Cancer using Digital Colposcopy

Aug 11, 2011

Abhishek Das, Avijit Kar, Debasis Bhattacharyya

Abstract:Cervical Cancer is one of the most common forms of cancer in women worldwide. Most cases of cervical cancer can be prevented through screening programs aimed at detecting precancerous lesions. During Digital Colposcopy, Specular Reflections (SR) appear as bright spots heavily saturated with white light. These occur due to the presence of moisture on the uneven cervix surface, which act like mirrors reflecting light from the illumination source. Apart from camouflaging the actual features, the SR also affects subsequent segmentation routines and hence must be removed. Our novel technique eliminates the SR and makes the colposcopic images (cervigram) ready for segmentation algorithms. The cervix region occupies about half of the cervigram image. Other parts of the image contain irrelevant information, such as equipment, frames, text and non-cervix tissues. This irrelevant information can confuse automatic identification of the tissues within the cervix. The first step is, therefore, focusing on the cervical borders, so that we have a geometric boundary on the relevant image area. We have proposed a type of modified kmeans clustering algorithm to evaluate the region of interest.

* http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5962218, 2011
* IEEE Imaging Systems and Techniques, 2011, Print ISBN: 978-1-61284-894-5, pages 237 - 241

Via

Access Paper or Ask Questions

Preprocessing for Automating Early Detection of Cervical Cancer

Aug 11, 2011

Abhishek Das, Avijit Kar, Debasis Bhattacharyya

Abstract:Uterine Cervical Cancer is one of the most common forms of cancer in women worldwide. Most cases of cervical cancer can be prevented through screening programs aimed at detecting precancerous lesions. During Digital Colposcopy, colposcopic images or cervigrams are acquired in raw form. They contain specular reflections which appear as bright spots heavily saturated with white light and occur due to the presence of moisture on the uneven cervix surface and. The cervix region occupies about half of the raw cervigram image. Other parts of the image contain irrelevant information, such as equipment, frames, text and non-cervix tissues. This irrelevant information can confuse automatic identification of the tissues within the cervix. Therefore we focus on the cervical borders, so that we have a geometric boundary on the relevant image area. Our novel technique eliminates the SR, identifies the region of interest and makes the cervigram ready for segmentation algorithms.

* 15th International Conference on Information Visualisation (Track: 8th International Conference BioMedical Visualization) at London, UK (IEEE Computer Society)

Via

Access Paper or Ask Questions

Preprocessing: A Step in Automating Early Detection of Cervical Cancer

Aug 11, 2011

Abhishek Das, Avijit Kar, Debasis Bhattacharyya

Abstract:This paper has been withdrawn

* wrong conference name mentioned (This paper has been withdrawn)

Via

Access Paper or Ask Questions