Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marcus Liwicki

ICDAR 2019 Historical Document Reading Challenge on Large Structured Chinese Family Records

Mar 18, 2019
Foteini Simistira Liwicki, Rajkumar Saini, Derek Dobson, Jon Morrey, Marcus Liwicki

We propose a Historical Document Reading Challenge on Large Chinese Structured Family Records, in short ICDAR2019 HDRC CHINESE. The objective of the proposed competition is to recognize and analyze the layout, and finally detect and recognize the textlines and characters of the large historical document collection containing more than 20 000 pages kindly provided by FamilySearch.

Via

Access Paper or Ask Questions

Using Deep Object Features for Image Descriptions

Feb 25, 2019
Ashutosh Mishra, Marcus Liwicki

Figure 1 for Using Deep Object Features for Image Descriptions

Figure 2 for Using Deep Object Features for Image Descriptions

Figure 3 for Using Deep Object Features for Image Descriptions

Figure 4 for Using Deep Object Features for Image Descriptions

Inspired by recent advances in leveraging multiple modalities in machine translation, we introduce an encoder-decoder pipeline that uses (1) specific objects within an image and their object labels, (2) a language model for decoding joint embedding of object features and the object labels. Our pipeline merges prior detected objects from the image and their object labels and then learns the sequences of captions describing the particular image. The decoder model learns to extract descriptions for the image from scratch by decoding the joint representation of the object visual features and their object classes conditioned by the encoder component. The idea of the model is to concentrate only on the specific objects of the image and their labels for generating descriptions of the image rather than visual feature of the entire image. The model needs to be calibrated more by adjusting the parameters and settings to result in better accuracy and performance.

* arXiv admin note: text overlap with arXiv:1411.2539, arXiv:1609.06647 by other authors

Via

Access Paper or Ask Questions

A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Jan 08, 2019
Kumar Shridhar, Felix Laumann, Marcus Liwicki

Figure 1 for A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Figure 2 for A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Figure 3 for A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Figure 4 for A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Artificial Neural Networks are connectionist systems that perform a given task by learning on examples without having prior knowledge about the task. This is done by finding an optimal point estimate for the weights in every node. Generally, the network using point estimates as weights perform well with large datasets, but they fail to express uncertainty in regions with little or no data, leading to overconfident decisions. In this paper, Bayesian Convolutional Neural Network (BayesCNN) using Variational Inference is proposed, that introduces probability distribution over the weights. Furthermore, the proposed BayesCNN architecture is applied to tasks like Image Classification, Image Super-Resolution and Generative Adversarial Networks. The results are compared to point-estimates based architectures on MNIST, CIFAR-10 and CIFAR-100 datasets for Image CLassification task, on BSD300 dataset for Image Super Resolution task and on CIFAR10 dataset again for Generative Adversarial Network task. BayesCNN is based on Bayes by Backprop which derives a variational approximation to the true posterior. We, therefore, introduce the idea of applying two convolutional operations, one for the mean and one for the variance. Our proposed method not only achieves performances equivalent to frequentist inference in identical architectures but also incorporate a measurement for uncertainties and regularisation. It further eliminates the use of dropout in the model. Moreover, we predict how certain the model prediction is based on the epistemic and aleatoric uncertainties and empirically show how the uncertainty can decrease, allowing the decisions made by the network to become more deterministic as the training accuracy increases. Finally, we propose ways to prune the Bayesian architecture and to make it more computational and time effective.

* arXiv admin note: text overlap with arXiv:1506.02158, arXiv:1703.04977 by other authors

Via

Access Paper or Ask Questions

Leveraging Random Label Memorization for Unsupervised Pre-Training

Nov 05, 2018
Vinaychandran Pondenkandath, Michele Alberti, Sammer Puran, Rolf Ingold, Marcus Liwicki

Figure 1 for Leveraging Random Label Memorization for Unsupervised Pre-Training

Figure 2 for Leveraging Random Label Memorization for Unsupervised Pre-Training

Figure 3 for Leveraging Random Label Memorization for Unsupervised Pre-Training

Figure 4 for Leveraging Random Label Memorization for Unsupervised Pre-Training

We present a novel approach to leverage large unlabeled datasets by pre-training state-of-the-art deep neural networks on randomly-labeled datasets. Specifically, we train the neural networks to memorize arbitrary labels for all the samples in a dataset and use these pre-trained networks as a starting point for regular supervised learning. Our assumption is that the "memorization infrastructure" learned by the network during the random-label training proves to be beneficial for the conventional supervised learning as well. We test the effectiveness of our pre-training on several video action recognition datasets (HMDB51, UCF101, Kinetics) by comparing the results of the same network with and without the random label pre-training. Our approach yields an improvement - ranging from 1.5% on UCF-101 to 5% on Kinetics - in classification accuracy, which calls for further research in this direction.

* 6 pages

Via

Access Paper or Ask Questions

Offline Signature Verification by Combining Graph Edit Distance and Triplet Networks

Oct 17, 2018
Paul Maergner, Vinaychandran Pondenkandath, Michele Alberti, Marcus Liwicki, Kaspar Riesen, Rolf Ingold, Andreas Fischer

Figure 1 for Offline Signature Verification by Combining Graph Edit Distance and Triplet Networks

Figure 2 for Offline Signature Verification by Combining Graph Edit Distance and Triplet Networks

Figure 3 for Offline Signature Verification by Combining Graph Edit Distance and Triplet Networks

Figure 4 for Offline Signature Verification by Combining Graph Edit Distance and Triplet Networks

Biometric authentication by means of handwritten signatures is a challenging pattern recognition task, which aims to infer a writer model from only a handful of genuine signatures. In order to make it more difficult for a forger to attack the verification system, a promising strategy is to combine different writer models. In this work, we propose to complement a recent structural approach to offline signature verification based on graph edit distance with a statistical approach based on metric learning with deep neural networks. On the MCYT and GPDS benchmark datasets, we demonstrate that combining the structural and statistical models leads to significant improvements in performance, profiting from their complementary properties.

* Structural, Syntactic, and Statistical Pattern Recognition. S+SSPR 2018. Lecture Notes in Computer Science, vol 11004. Springer, Cham

Via

Access Paper or Ask Questions

Subword Semantic Hashing for Intent Classification on Small Datasets

Oct 16, 2018
Kumar Shridhar, Amit Sahu, Ayushman Dash, Pedro Alonso, Gustav Pihlgren, Vinay Pondeknath, Fotini Simistira, Marcus Liwicki

Figure 1 for Subword Semantic Hashing for Intent Classification on Small Datasets

Figure 2 for Subword Semantic Hashing for Intent Classification on Small Datasets

Figure 3 for Subword Semantic Hashing for Intent Classification on Small Datasets

Figure 4 for Subword Semantic Hashing for Intent Classification on Small Datasets

In this paper, we introduce the use of Semantic Hashing as embedding for the task of Intent Classification and outperform previous state-of-the-art methods on three frequently used benchmarks. Intent Classification on a small dataset is a challenging task for data-hungry state-of-the-art Deep Learning based systems. Semantic Hashing is an attempt to overcome such a challenge and learn robust text classification. Current word embedding based methods are dependent on vocabularies. One of the major drawbacks of such methods is out-of-vocabulary terms, especially when having small training datasets and using a wider vocabulary. This is the case in Intent Classification for chatbots, where typically small datasets are extracted from internet communication. Two problems arise by the use of internet communication. First, such datasets miss a lot of terms in the vocabulary to use word embeddings efficiently. Second, users frequently make spelling errors. Typically, the models for intent classification are not trained with spelling errors and it is difficult to think about ways in which users will make mistakes. Models depending on a word vocabulary will always face such issues. An ideal classifier should handle spelling errors inherently. With Semantic Hashing, we overcome these challenges and achieve state-of-the-art results on three datasets: AskUbuntu, Chatbot, and Web Application. Our benchmarks are available online: https://github.com/kumar-shridhar/Know-Your-Intent

Via

Access Paper or Ask Questions

Bayesian Convolutional Neural Networks

Sep 10, 2018
Kumar Shridhar, Felix Laumann, Adrian Llopart Maurin, Marcus Liwicki

Figure 1 for Bayesian Convolutional Neural Networks

Figure 2 for Bayesian Convolutional Neural Networks

Figure 3 for Bayesian Convolutional Neural Networks

Figure 4 for Bayesian Convolutional Neural Networks

We introduce Bayesian Convolutional Neural Networks (BayesCNNs), a variant of Convolutional Neural Networks (CNNs) which is built upon Bayes by Backprop. We demonstrate how this novel reliable variational inference method can serve as a fundamental construct for various network architectures. On multiple datasets in supervised learning settings (MNIST, CIFAR-10, CIFAR-100, and STL-10), our proposed variational inference method achieves performances equivalent to frequentist inference in identical architectures, while a measurement for uncertainties and a regularisation are incorporated naturally. In the past, Bayes by Backprop has been successfully implemented in feedforward and recurrent neural networks, but not in convolutional ones. This work symbolises the extension of Bayesian neural networks which encompasses all three aforementioned types of network architectures now.

* arXiv admin note: text overlap with arXiv:1704.02798 by other authors

Via

Access Paper or Ask Questions

Are You Tampering With My Data?

Aug 21, 2018
Michele Alberti, Vinaychandran Pondenkandath, Marcel Würsch, Manuel Bouillon, Mathias Seuret, Rolf Ingold, Marcus Liwicki

Figure 1 for Are You Tampering With My Data?

Figure 2 for Are You Tampering With My Data?

Figure 3 for Are You Tampering With My Data?

Figure 4 for Are You Tampering With My Data?

We propose a novel approach towards adversarial attacks on neural networks (NN), focusing on tampering the data used for training instead of generating attacks on trained models. Our network-agnostic method creates a backdoor during training which can be exploited at test time to force a neural network to exhibit abnormal behaviour. We demonstrate on two widely used datasets (CIFAR-10 and SVHN) that a universal modification of just one pixel per image for all the images of a class in the training set is enough to corrupt the training procedure of several state-of-the-art deep neural networks causing the networks to misclassify any images to which the modification is applied. Our aim is to bring to the attention of the machine learning community, the possibility that even learning-based methods that are personally trained on public datasets can be subject to attacks by a skillful adversary.

* European Conference on Computer Vision (ECCV 2018), Workshop on Objectionable Content and Misinformation
* 18 pages

Via

Access Paper or Ask Questions

Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks

Jun 22, 2018
Andreas Kölsch, Ashutosh Mishra, Saurabh Varshneya, Muhammad Zeshan Afzal, Marcus Liwicki

Figure 1 for Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks

Figure 2 for Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks

Figure 3 for Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks

Figure 4 for Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks

This paper introduces a very challenging dataset of historic German documents and evaluates Fully Convolutional Neural Network (FCNN) based methods to locate handwritten annotations of any kind in these documents. The handwritten annotations can appear in form of underlines and text by using various writing instruments, e.g., the use of pencils makes the data more challenging. We train and evaluate various end-to-end semantic segmentation approaches and report the results. The task is to classify the pixels of documents into two classes: background and handwritten annotation. The best model achieves a mean Intersection over Union (IoU) score of 95.6% on the test documents of the presented dataset. We also present a comparison of different strategies used for data augmentation and training on our presented dataset. For evaluation, we use the Layout Analysis Evaluator for the ICDAR 2017 Competition on Layout Analysis for Challenging Medieval Manuscripts.

Via

Access Paper or Ask Questions

Bidirectional Learning for Robust Neural Networks

May 21, 2018
Sidney Pontes-Filho, Marcus Liwicki

Figure 1 for Bidirectional Learning for Robust Neural Networks

Figure 2 for Bidirectional Learning for Robust Neural Networks

Figure 3 for Bidirectional Learning for Robust Neural Networks

Figure 4 for Bidirectional Learning for Robust Neural Networks

A multilayer perceptron can behave as a generative classifier by applying bidirectional learning (BL). It consists of training an undirected neural network to map input to output and vice-versa; therefore it can produce a classifier in one direction, and a generator in the opposite direction for the same data. In this paper, two novel learning techniques are introduced which use BL for improving robustness to white noise static and adversarial examples. The first method is bidirectional propagation of errors, which the error propagation occurs in backward and forward directions. Motivated by the fact that its generative model receives as input a constant vector per class, we introduce as a second method the hybrid adversarial networks (HAN). Its generative model receives a random vector as input and its training is based on generative adversarial networks (GAN). To assess the performance of BL, we perform experiments using several architectures with fully and convolutional layers, with and without bias. Experimental results show that both methods improve robustness to white noise static and adversarial examples, but have different behaviour depending on the architecture and task, being more beneficial to use the one or the other. Nevertheless, HAN using a convolutional architecture with batch normalization presents outstanding robustness, reaching state-of-the-art accuracy on adversarial examples of hand-written digits.

* 10 pages, 4 figures, 5 tables

Via

Access Paper or Ask Questions