Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vinay P. Namboodiri

Looking back at Labels: A Class based Domain Adaptation Technique

Apr 02, 2019
Vinod Kumar Kurmi, Vinay P. Namboodiri

Figure 1 for Looking back at Labels: A Class based Domain Adaptation Technique

Figure 2 for Looking back at Labels: A Class based Domain Adaptation Technique

Figure 3 for Looking back at Labels: A Class based Domain Adaptation Technique

Figure 4 for Looking back at Labels: A Class based Domain Adaptation Technique

In this paper, we solve the problem of adapting classifiers across domains. We consider the problem of domain adaptation for multi-class classification where we are provided a labeled set of examples in a source dataset and we are provided a target dataset with no supervision. In this setting, we propose an adversarial discriminator based approach. While the approach based on adversarial discriminator has been previously proposed; in this paper, we present an informed adversarial discriminator. Our observation relies on the analysis that shows that if the discriminator has access to all the information available including the class structure present in the source dataset, then it can guide the transformation of features of the target set of classes to a more structure adapted space. Using this formulation, we obtain state-of-the-art results for the standard evaluation on benchmark datasets. We further provide detailed analysis which shows that using all the labeled information results in an improved domain adaptation.

* IJCNN 2019 Accepted

Via

Access Paper or Ask Questions

HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs

Mar 25, 2019
Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri

Figure 1 for HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs

Figure 2 for HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs

Figure 3 for HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs

Figure 4 for HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs

We present a novel deep learning architecture in which the convolution operation leverages heterogeneous kernels. The proposed HetConv (Heterogeneous Kernel-Based Convolution) reduces the computation (FLOPs) and the number of parameters as compared to standard convolution operation while still maintaining representational efficiency. To show the effectiveness of our proposed convolution, we present extensive experimental results on the standard convolutional neural network (CNN) architectures such as VGG \cite{vgg2014very} and ResNet \cite{resnet}. We find that after replacing the standard convolutional filters in these architectures with our proposed HetConv filters, we achieve 3X to 8X FLOPs based improvement in speed while still maintaining (and sometimes improving) the accuracy. We also compare our proposed convolutions with group/depth wise convolutions and show that it achieves more FLOPs reduction with significantly higher accuracy.

* Accepted in CVPR 2019

Via

Access Paper or Ask Questions

CVIT-MT Systems for WAT-2018

Mar 19, 2019
Jerin Philip, Vinay P. Namboodiri, C. V. Jawahar

Figure 1 for CVIT-MT Systems for WAT-2018

Figure 2 for CVIT-MT Systems for WAT-2018

Figure 3 for CVIT-MT Systems for WAT-2018

This document describes the machine translation system used in the submissions of IIIT-Hyderabad CVIT-MT for the WAT-2018 English-Hindi translation task. Performance is evaluated on the associated corpus provided by the organizers. We experimented with convolutional sequence to sequence architectures. We also train with additional data obtained through backtranslation.

Via

Access Paper or Ask Questions

Accuracy Booster: Performance Boosting using Feature Map Re-calibration

Mar 11, 2019
Pravendra Singh, Pratik Mazumder, Vinay P. Namboodiri

Figure 1 for Accuracy Booster: Performance Boosting using Feature Map Re-calibration

Figure 2 for Accuracy Booster: Performance Boosting using Feature Map Re-calibration

Figure 3 for Accuracy Booster: Performance Boosting using Feature Map Re-calibration

Figure 4 for Accuracy Booster: Performance Boosting using Feature Map Re-calibration

Convolution Neural Networks (CNN) have been extremely successful in solving intensive computer vision tasks. The convolutional filters used in CNNs have played a major role in this success, by extracting useful features from the inputs. Recently researchers have tried to boost the performance of CNNs by re-calibrating the feature maps produced by these filters, e.g., Squeeze-and-Excitation Networks (SENets). These approaches have achieved better performance by \textit{Exciting} up the important channels or feature maps while diminishing the rest. However, in the process, architectural complexity has increased. We propose an architectural block that introduces much lower complexity than the existing methods of CNN performance boosting while performing significantly better than them. We carry out experiments on the CIFAR, ImageNet and MS-COCO datasets, and show that the proposed block can challenge the state-of-the-art results. Our method boosts the ResNet-50 architecture to perform comparably to the ResNet-152 architecture, which is a three times deeper network, on classification. We also show experimentally that our method is not limited to classification but also generalizes well to other tasks such as object detection.

Via

Access Paper or Ask Questions

PUTWorkbench: Analysing Privacy in AI-intensive Systems

Feb 05, 2019
Saurabh Srivastava, Vinay P. Namboodiri, T. V. Prabhakar

Figure 1 for PUTWorkbench: Analysing Privacy in AI-intensive Systems

Figure 2 for PUTWorkbench: Analysing Privacy in AI-intensive Systems

Figure 3 for PUTWorkbench: Analysing Privacy in AI-intensive Systems

Figure 4 for PUTWorkbench: Analysing Privacy in AI-intensive Systems

AI intensive systems that operate upon user data face the challenge of balancing data utility with privacy concerns. We propose the idea and present the prototype of an open-source tool called Privacy Utility Trade-off (PUT) Workbench which seeks to aid software practitioners to take such crucial decisions. We pick a simple privacy model that doesn't require any background knowledge in Data Science and show how even that can achieve significant results over standard and real-life datasets. The tool and the source code is made freely available for extensions and usage.

Via

Access Paper or Ask Questions

Leveraging Filter Correlations for Deep Model Compression

Nov 26, 2018
Pravendra Singh, Vinay Kumar Verma, Piyush Rai, Vinay P. Namboodiri

Figure 1 for Leveraging Filter Correlations for Deep Model Compression

Figure 2 for Leveraging Filter Correlations for Deep Model Compression

Figure 3 for Leveraging Filter Correlations for Deep Model Compression

Figure 4 for Leveraging Filter Correlations for Deep Model Compression

We present a filter correlation based model compression approach for deep convolutional neural networks. Our approach iteratively identifies pairs of filters with largest pairwise correlations and discards one of the filters from each such pair. However, instead of discarding one of the filter from such pairs na\"{i}vely, we further optimize the model so that the two filters from each such pair are as highly correlated as possible so that discarding one of the filters from the pairs results in as little information loss as possible. After discarding the filters in each round, we further finetune the model to recover from the potential small loss incurred by the compression. We evaluate our proposed approach using a comprehensive set of experiments and ablation studies. Our compression method yields state-of-the-art FLOPs compression rates on various benchmarks, such as LeNet-5, VGG-16, and ResNet-50,56, which are still achieving excellent predictive performance for tasks such as object detection on benchmark datasets.

* 10 pages

Via

Access Paper or Ask Questions

Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector

Nov 20, 2018
Pravendra Singh, Manikandan R, Neeraj Matiyali, Vinay P. Namboodiri

Figure 1 for Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector

Figure 2 for Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector

Figure 3 for Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector

Figure 4 for Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector

We propose a framework for compressing state-of-the-art Single Shot MultiBox Detector (SSD). The framework addresses compression in the following stages: Sparsity Induction, Filter Selection, and Filter Pruning. In the Sparsity Induction stage, the object detector model is sparsified via an improved global threshold. In Filter Selection & Pruning stage, we select and remove filters using sparsity statistics of filter weights in two consecutive convolutional layers. This results in the model with the size smaller than most existing compact architectures. We evaluate the performance of our framework with multiple datasets and compare over multiple methods. Experimental results show that our method achieves state-of-the-art compression of 6.7X and 4.9X on PASCAL VOC dataset on models SSD300 and SSD512 respectively. We further show that the method produces maximum compression of 26X with SSD512 on German Traffic Sign Detection Benchmark (GTSDB). Additionally, we also empirically show our method's adaptability for classification based architecture VGG16 on datasets CIFAR and German Traffic Sign Recognition Benchmark (GTSRB) achieving a compression rate of 125X and 200X with the reduction in flops by 90.50% and 96.6% respectively with no loss of accuracy. In addition to this, our method does not require any special libraries or hardware support for the resulting compressed models.

* IEEE Winter Conference on Applications of Computer Vision (WACV), 2019

Via

Access Paper or Ask Questions

Stability Based Filter Pruning for Accelerating Deep CNNs

Nov 20, 2018
Pravendra Singh, Vinay Sameer Raja Kadi, Nikhil Verma, Vinay P. Namboodiri

Figure 1 for Stability Based Filter Pruning for Accelerating Deep CNNs

Figure 2 for Stability Based Filter Pruning for Accelerating Deep CNNs

Figure 3 for Stability Based Filter Pruning for Accelerating Deep CNNs

Figure 4 for Stability Based Filter Pruning for Accelerating Deep CNNs

Convolutional neural networks (CNN) have achieved impressive performance on the wide variety of tasks (classification, detection, etc.) across multiple domains at the cost of high computational and memory requirements. Thus, leveraging CNNs for real-time applications necessitates model compression approaches that not only reduce the total number of parameters but reduce the overall computation as well. In this work, we present a stability-based approach for filter-level pruning of CNNs. We evaluate our proposed approach on different architectures (LeNet, VGG-16, ResNet, and Faster RCNN) and datasets and demonstrate its generalizability through extensive experiments. Moreover, our compressed models can be used at run-time without requiring any special libraries or hardware. Our model compression method reduces the number of FLOPS by an impressive factor of 6.03X and GPU memory footprint by more than 17X, significantly outperforming other state-of-the-art filter pruning methods.

* IEEE Winter Conference on Applications of Computer Vision (WACV), 2019

Via

Access Paper or Ask Questions

Multimodal Differential Network for Visual Question Generation

Aug 12, 2018
Badri N. Patro, Sandeep Kumar, Vinod K. Kurmi, Vinay P. Namboodiri

Figure 1 for Multimodal Differential Network for Visual Question Generation

Figure 2 for Multimodal Differential Network for Visual Question Generation

Figure 3 for Multimodal Differential Network for Visual Question Generation

Figure 4 for Multimodal Differential Network for Visual Question Generation

Generating natural questions from an image is a semantic task that requires using visual and language modality to learn multimodal representations. Images can have multiple visual and language contexts that are relevant for generating questions namely places, captions, and tags. In this paper, we propose the use of exemplars for obtaining the relevant context. We obtain this by using a Multimodal Differential Network to produce natural and engaging questions. The generated questions show a remarkable similarity to the natural questions as validated by a human study. Further, we observe that the proposed approach substantially improves over state-of-the-art benchmarks on the quantitative metrics (BLEU, METEOR, ROUGE, and CIDEr).

* EMNLP 2018 (accepted)

Via

Access Paper or Ask Questions

Learning Semantic Sentence Embeddings using Pair-wise Discriminator

Jul 02, 2018
Badri N. Patro, Vinod K. Kurmi, Sandeep Kumar, Vinay P. Namboodiri

Figure 1 for Learning Semantic Sentence Embeddings using Pair-wise Discriminator

Figure 2 for Learning Semantic Sentence Embeddings using Pair-wise Discriminator

Figure 3 for Learning Semantic Sentence Embeddings using Pair-wise Discriminator

Figure 4 for Learning Semantic Sentence Embeddings using Pair-wise Discriminator

In this paper, we propose a method for obtaining sentence-level embeddings. While the problem of securing word-level embeddings is very well studied, we propose a novel method for obtaining sentence-level embeddings. This is obtained by a simple method in the context of solving the paraphrase generation task. If we use a sequential encoder-decoder model for generating paraphrase, we would like the generated paraphrase to be semantically close to the original sentence. One way to ensure this is by adding constraints for true paraphrase embeddings to be close and unrelated paraphrase candidate sentence embeddings to be far. This is ensured by using a sequential pair-wise discriminator that shares weights with the encoder that is trained with a suitable loss function. Our loss function penalizes paraphrase sentence embedding distances from being too large. This loss is used in combination with a sequential encoder-decoder network. We also validated our method by evaluating the obtained embeddings for a sentiment analysis task. The proposed method results in semantic embeddings and outperforms the state-of-the-art on the paraphrase generation and sentiment analysis task on standard datasets. These results are also shown to be statistically significant.

* COLING 2018 (accepted)

Via

Access Paper or Ask Questions