Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mayank Vatsa

Class Equilibrium using Coulomb's Law

Apr 25, 2021

Saheb Chhabra, Puspita Majumdar, Mayank Vatsa, Richa Singh

Figure 1 for Class Equilibrium using Coulomb's Law

Figure 2 for Class Equilibrium using Coulomb's Law

Figure 3 for Class Equilibrium using Coulomb's Law

Figure 4 for Class Equilibrium using Coulomb's Law

Abstract:Projection algorithms learn a transformation function to project the data from input space to the feature space, with the objective of increasing the inter-class distance. However, increasing the inter-class distance can affect the intra-class distance. Maintaining an optimal inter-class separation among the classes without affecting the intra-class distance of the data distribution is a challenging task. In this paper, inspired by the Coulomb's law of Electrostatics, we propose a new algorithm to compute the equilibrium space of any data distribution where the separation among the classes is optimal. The algorithm further learns the transformation between the input space and equilibrium space to perform classification in the equilibrium space. The performance of the proposed algorithm is evaluated on four publicly available datasets at three different resolutions. It is observed that the proposed algorithm performs well for low-resolution images.

* Accepted at IJCNN 2021

Via

Access Paper or Ask Questions

Age Gap Reducer-GAN for Recognizing Age-Separated Faces

Nov 11, 2020

Daksha Yadav, Naman Kohli, Mayank Vatsa, Richa Singh, Afzel Noore

Figure 1 for Age Gap Reducer-GAN for Recognizing Age-Separated Faces

Figure 2 for Age Gap Reducer-GAN for Recognizing Age-Separated Faces

Figure 3 for Age Gap Reducer-GAN for Recognizing Age-Separated Faces

Figure 4 for Age Gap Reducer-GAN for Recognizing Age-Separated Faces

Abstract:In this paper, we propose a novel algorithm for matching faces with temporal variations caused due to age progression. The proposed generative adversarial network algorithm is a unified framework that combines facial age estimation and age-separated face verification. The key idea of this approach is to learn the age variations across time by conditioning the input image on the subject's gender and the target age group to which the face needs to be progressed. The loss function accounts for reducing the age gap between the original image and generated face image as well as preserving the identity. Both visual fidelity and quantitative evaluations demonstrate the efficacy of the proposed architecture on different facial age databases for age-separated face recognition.

Via

Access Paper or Ask Questions

Trustworthy AI

Nov 02, 2020

Richa Singh, Mayank Vatsa, Nalini Ratha

Abstract:Modern AI systems are reaping the advantage of novel learning methods. With their increasing usage, we are realizing the limitations and shortfalls of these systems. Brittleness to minor adversarial changes in the input data, ability to explain the decisions, address the bias in their training data, high opacity in terms of revealing the lineage of the system, how they were trained and tested, and under which parameters and conditions they can reliably guarantee a certain level of performance, are some of the most prominent limitations. Ensuring the privacy and security of the data, assigning appropriate credits to data sources, and delivering decent outputs are also required features of an AI system. We propose the tutorial on Trustworthy AI to address six critical issues in enhancing user and public trust in AI systems, namely: (i) bias and fairness, (ii) explainability, (iii) robust mitigation of adversarial attacks, (iv) improved privacy and security in model building, (v) being decent, and (vi) model attribution, including the right level of credit assignment to the data sources, model architectures, and transparency in lineage.

* ACM CODS-COMAD 2021 Tutorial

Via

Access Paper or Ask Questions

WaveTransform: Crafting Adversarial Examples via Input Decomposition

Oct 29, 2020

Divyam Anshumaan, Akshay Agarwal, Mayank Vatsa, Richa Singh

Figure 1 for WaveTransform: Crafting Adversarial Examples via Input Decomposition

Figure 2 for WaveTransform: Crafting Adversarial Examples via Input Decomposition

Figure 3 for WaveTransform: Crafting Adversarial Examples via Input Decomposition

Figure 4 for WaveTransform: Crafting Adversarial Examples via Input Decomposition

Abstract:Frequency spectrum has played a significant role in learning unique and discriminating features for object recognition. Both low and high frequency information present in images have been extracted and learnt by a host of representation learning techniques, including deep learning. Inspired by this observation, we introduce a novel class of adversarial attacks, namely `WaveTransform', that creates adversarial noise corresponding to low-frequency and high-frequency subbands, separately (or in combination). The frequency subbands are analyzed using wavelet decomposition; the subbands are corrupted and then used to construct an adversarial example. Experiments are performed using multiple databases and CNN models to establish the effectiveness of the proposed WaveTransform attack and analyze the importance of a particular frequency component. The robustness of the proposed attack is also evaluated through its transferability and resiliency against a recent adversarial defense algorithm. Experiments show that the proposed attack is effective against the defense algorithm and is also transferable across CNNs.

* ECCV Workshop Adversarial Robustness in the Real World 2020, 17 pages, 3 Tables, 6 Figures

Via

Access Paper or Ask Questions

Attack Agnostic Adversarial Defense via Visual Imperceptible Bound

Oct 25, 2020

Saheb Chhabra, Akshay Agarwal, Richa Singh, Mayank Vatsa

Figure 1 for Attack Agnostic Adversarial Defense via Visual Imperceptible Bound

Figure 2 for Attack Agnostic Adversarial Defense via Visual Imperceptible Bound

Figure 3 for Attack Agnostic Adversarial Defense via Visual Imperceptible Bound

Figure 4 for Attack Agnostic Adversarial Defense via Visual Imperceptible Bound

Abstract:The high susceptibility of deep learning algorithms against structured and unstructured perturbations has motivated the development of efficient adversarial defense algorithms. However, the lack of generalizability of existing defense algorithms and the high variability in the performance of the attack algorithms for different databases raises several questions on the effectiveness of the defense algorithms. In this research, we aim to design a defense model that is robust within a certain bound against both seen and unseen adversarial attacks. This bound is related to the visual appearance of an image, and we termed it as \textit{Visual Imperceptible Bound (VIB)}. To compute this bound, we propose a novel method that uses the database characteristics. The VIB is further used to measure the effectiveness of attack algorithms. The performance of the proposed defense model is evaluated on the MNIST, CIFAR-10, and Tiny ImageNet databases on multiple attacks that include C\&W ($l_2$) and DeepFool. The proposed defense model is not only able to increase the robustness against several attacks but also retain or improve the classification accuracy on an original clean test set. The proposed algorithm is attack agnostic, i.e. it does not require any knowledge of the attack algorithm.

* ICPR 2020, 8 pages, 5 figures, 7 tables

Via

Access Paper or Ask Questions

MixNet for Generalized Face Presentation Attack Detection

Oct 25, 2020

Nilay Sanghvi, Sushant Kumar Singh, Akshay Agarwal, Mayank Vatsa, Richa Singh

Figure 1 for MixNet for Generalized Face Presentation Attack Detection

Figure 2 for MixNet for Generalized Face Presentation Attack Detection

Figure 3 for MixNet for Generalized Face Presentation Attack Detection

Figure 4 for MixNet for Generalized Face Presentation Attack Detection

Abstract:The non-intrusive nature and high accuracy of face recognition algorithms have led to their successful deployment across multiple applications ranging from border access to mobile unlocking and digital payments. However, their vulnerability against sophisticated and cost-effective presentation attack mediums raises essential questions regarding its reliability. In the literature, several presentation attack detection algorithms are presented; however, they are still far behind from reality. The major problem with existing work is the generalizability against multiple attacks both in the seen and unseen setting. The algorithms which are useful for one kind of attack (such as print) perform unsatisfactorily for another type of attack (such as silicone masks). In this research, we have proposed a deep learning-based network termed as \textit{MixNet} to detect presentation attacks in cross-database and unseen attack settings. The proposed algorithm utilizes state-of-the-art convolutional neural network architectures and learns the feature mapping for each attack category. Experiments are performed using multiple challenging face presentation attack databases such as SMAD and Spoof In the Wild (SiW-M) databases. Extensive experiments and comparison with existing state of the art algorithms show the effectiveness of the proposed algorithm.

* ICPR 2020, 8 pages, 6 figures, 7 tables

Via

Access Paper or Ask Questions

Generalized Iris Presentation Attack Detection Algorithm under Cross-Database Settings

Oct 25, 2020

Mehak Gupta, Vishal Singh, Akshay Agarwal, Mayank Vatsa, Richa Singh

Figure 1 for Generalized Iris Presentation Attack Detection Algorithm under Cross-Database Settings

Figure 2 for Generalized Iris Presentation Attack Detection Algorithm under Cross-Database Settings

Figure 3 for Generalized Iris Presentation Attack Detection Algorithm under Cross-Database Settings

Figure 4 for Generalized Iris Presentation Attack Detection Algorithm under Cross-Database Settings

Abstract:Presentation attacks are posing major challenges to most of the biometric modalities. Iris recognition, which is considered as one of the most accurate biometric modality for person identification, has also been shown to be vulnerable to advanced presentation attacks such as 3D contact lenses and textured lens. While in the literature, several presentation attack detection (PAD) algorithms are presented; a significant limitation is the generalizability against an unseen database, unseen sensor, and different imaging environment. To address this challenge, we propose a generalized deep learning-based PAD network, MVANet, which utilizes multiple representation layers. It is inspired by the simplicity and success of hybrid algorithm or fusion of multiple detection networks. The computational complexity is an essential factor in training deep neural networks; therefore, to reduce the computational complexity while learning multiple feature representation layers, a fixed base model has been used. The performance of the proposed network is demonstrated on multiple databases such as IIITD-WVU MUIPA and IIITD-CLI databases under cross-database training-testing settings, to assess the generalizability of the proposed algorithm.

* ICPR 2020, 8 pages, 7 figures, 4 tables

Via

Access Paper or Ask Questions

Unravelling Small Sample Size Problems in the Deep Learning World

Aug 08, 2020

Rohit Keshari, Soumyadeep Ghosh, Saheb Chhabra, Mayank Vatsa, Richa Singh

Figure 1 for Unravelling Small Sample Size Problems in the Deep Learning World

Figure 2 for Unravelling Small Sample Size Problems in the Deep Learning World

Figure 3 for Unravelling Small Sample Size Problems in the Deep Learning World

Figure 4 for Unravelling Small Sample Size Problems in the Deep Learning World

Abstract:The growth and success of deep learning approaches can be attributed to two major factors: availability of hardware resources and availability of large number of training samples. For problems with large training databases, deep learning models have achieved superlative performances. However, there are a lot of \textit{small sample size or $S^3$} problems for which it is not feasible to collect large training databases. It has been observed that deep learning models do not generalize well on $S^3$ problems and specialized solutions are required. In this paper, we first present a review of deep learning algorithms for small sample size problems in which the algorithms are segregated according to the space in which they operate, i.e. input space, model space, and feature space. Secondly, we present Dynamic Attention Pooling approach which focuses on extracting global information from the most discriminative sub-part of the feature map. The performance of the proposed dynamic attention pooling is analyzed with state-of-the-art ResNet model on relatively small publicly available datasets such as SVHN, C10, C100, and TinyImageNet.

* 3 figures, 2 tables, accepted in BigMM 2020

Via

Access Paper or Ask Questions

Subclass Contrastive Loss for Injured Face Recognition

Aug 05, 2020

Puspita Majumdar, Saheb Chhabra, Richa Singh, Mayank Vatsa

Figure 1 for Subclass Contrastive Loss for Injured Face Recognition

Figure 2 for Subclass Contrastive Loss for Injured Face Recognition

Figure 3 for Subclass Contrastive Loss for Injured Face Recognition

Figure 4 for Subclass Contrastive Loss for Injured Face Recognition

Abstract:Deaths and injuries are common in road accidents, violence, and natural disaster. In such cases, one of the main tasks of responders is to retrieve the identity of the victims to reunite families and ensure proper identification of deceased/ injured individuals. Apart from this, identification of unidentified dead bodies due to violence and accidents is crucial for the police investigation. In the absence of identification cards, current practices for this task include DNA profiling and dental profiling. Face is one of the most commonly used and widely accepted biometric modalities for recognition. However, face recognition is challenging in the presence of facial injuries such as swelling, bruises, blood clots, laceration, and avulsion which affect the features used in recognition. In this paper, for the first time, we address the problem of injured face recognition and propose a novel Subclass Contrastive Loss (SCL) for this task. A novel database, termed as Injured Face (IF) database, is also created to instigate research in this direction. Experimental analysis shows that the proposed loss function surpasses existing algorithm for injured face recognition.

* Accepted in BTAS 2019

Via

Access Paper or Ask Questions

Multi-Task Driven Explainable Diagnosis of COVID-19 using Chest X-ray Images

Aug 03, 2020

Aakarsh Malhotra, Surbhi Mittal, Puspita Majumdar, Saheb Chhabra, Kartik Thakral, Mayank Vatsa, Richa Singh, Santanu Chaudhury, Ashwin Pudrod, Anjali Agrawal

Figure 1 for Multi-Task Driven Explainable Diagnosis of COVID-19 using Chest X-ray Images

Figure 2 for Multi-Task Driven Explainable Diagnosis of COVID-19 using Chest X-ray Images

Figure 3 for Multi-Task Driven Explainable Diagnosis of COVID-19 using Chest X-ray Images

Figure 4 for Multi-Task Driven Explainable Diagnosis of COVID-19 using Chest X-ray Images

Abstract:With increasing number of COVID-19 cases globally, all the countries are ramping up the testing numbers. While the RT-PCR kits are available in sufficient quantity in several countries, others are facing challenges with limited availability of testing kits and processing centers in remote areas. This has motivated researchers to find alternate methods of testing which are reliable, easily accessible and faster. Chest X-Ray is one of the modalities that is gaining acceptance as a screening modality. Towards this direction, the paper has two primary contributions. Firstly, we present the COVID-19 Multi-Task Network which is an automated end-to-end network for COVID-19 screening. The proposed network not only predicts whether the CXR has COVID-19 features present or not, it also performs semantic segmentation of the regions of interest to make the model explainable. Secondly, with the help of medical professionals, we manually annotate the lung regions of 9000 frontal chest radiographs taken from ChestXray-14, CheXpert and a consolidated COVID-19 dataset. Further, 200 chest radiographs pertaining to COVID-19 patients are also annotated for semantic segmentation. This database will be released to the research community.

Via

Access Paper or Ask Questions