Alert button
Picture for Prashant Pandey

Prashant Pandey

Alert button

Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism

Dec 02, 2019
Prashant Pandey, Prathosh AP, Manu Kohli, Josh Pritchard

Figure 1 for Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism
Figure 2 for Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism
Figure 3 for Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism
Figure 4 for Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism

Diagnostic and intervention methodologies for skill assessment of autism typically requires a clinician repetitively initiating several stimuli and recording the child's response. In this paper, we propose to automate the response measurement through video recording of the scene following the use of Deep Neural models for human action recognition from videos. However, supervised learning of neural networks demand large amounts of annotated data that are hard to come by. This issue is addressed by leveraging the `similarities' between the action categories in publicly available large-scale video action (source) datasets and the dataset of interest. A technique called guided weak supervision is proposed, where every class in the target data is matched to a class in the source data using the principle of posterior likelihood maximization. Subsequently, classifier on the target data is re-trained by augmenting samples from the matched source classes, along with a new loss encouraging inter-class separability. The proposed method is evaluated on two skill assessment autism datasets, SSBD and a real world Autism dataset comprising 37 children of different ages and ethnicity who are diagnosed with autism. Our proposed method is found to improve the performance of the state-of-the-art multi-class human action recognition models in-spite of supervision with scarce data.

* AAAI 2020 
Viaarxiv icon

Guided weak supervision for action recognition with scarce data to assess skills of children with autism

Nov 11, 2019
Prashant Pandey, Prathosh AP, Manu Kohli, Josh Pritchard

Figure 1 for Guided weak supervision for action recognition with scarce data to assess skills of children with autism
Figure 2 for Guided weak supervision for action recognition with scarce data to assess skills of children with autism
Figure 3 for Guided weak supervision for action recognition with scarce data to assess skills of children with autism
Figure 4 for Guided weak supervision for action recognition with scarce data to assess skills of children with autism

Diagnostic and intervention methodologies for skill assessment of autism typically requires a clinician repetitively initiating several stimuli and recording the child's response. In this paper, we propose to automate the response measurement through video recording of the scene following the use of Deep Neural models for human action recognition from videos. However, supervised learning of neural networks demand large amounts of annotated data that are hard to come by. This issue is addressed by leveraging the `similarities' between the action categories in publicly available large-scale video action (source) datasets and the dataset of interest. A technique called guided weak supervision is proposed, where every class in the target data is matched to a class in the source data using the principle of posterior likelihood maximization. Subsequently, classifier on the target data is re-trained by augmenting samples from the matched source classes, along with a new loss encouraging inter-class separability. The proposed method is evaluated on two skill assessment autism datasets, SSBD and a real world Autism dataset comprising 37 children of different ages and ethnicity who are diagnosed with autism. Our proposed method is found to improve the performance of the state-of-the-art multi-class human action recognition models in-spite of supervision with scarce data.

Viaarxiv icon

NEMGAN: Noise Engineered Mode-matching GAN

Nov 08, 2018
Deepak Mishra, Prathosh AP, Aravind J, Prashant Pandey, Santanu Chaudhury

Figure 1 for NEMGAN: Noise Engineered Mode-matching GAN
Figure 2 for NEMGAN: Noise Engineered Mode-matching GAN
Figure 3 for NEMGAN: Noise Engineered Mode-matching GAN
Figure 4 for NEMGAN: Noise Engineered Mode-matching GAN

Conditional generation refers to the process of sampling from an unknown distribution conditioned on semantics of the data. This can be achieved by augmenting the generative model with the desired semantic labels, albeit it is not straightforward in an unsupervised setting where the semantic label of every data sample is unknown. In this paper, we address this issue by proposing a method that can generate samples conditioned on the properties of a latent distribution engineered in accordance with a certain data prior. In particular, a latent space inversion network is trained in tandem with a generative adversarial network such that the modal properties of the latent space distribution are induced in the data generating distribution. We demonstrate that our model despite being fully unsupervised, is effective in learning meaningful representations through its mode matching property. We validate our method on multiple unsupervised tasks such as conditional generation, attribute discovery and inference using three real world image datasets namely MNIST, CIFAR-10 and CelebA and show that the results are comparable to the state-of-the-art methods.

Viaarxiv icon