Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Devendra Singh Sachan

Investigating the Working of Text Classifiers

Aug 05, 2018
Devendra Singh Sachan, Manzil Zaheer, Ruslan Salakhutdinov

Figure 1 for Investigating the Working of Text Classifiers

Figure 2 for Investigating the Working of Text Classifiers

Figure 3 for Investigating the Working of Text Classifiers

Figure 4 for Investigating the Working of Text Classifiers

Text classification is one of the most widely studied tasks in natural language processing. Motivated by the principle of compositionality, large multilayer neural network models have been employed for this task in an attempt to effectively utilize the constituent expressions. Almost all of the reported work train large networks using discriminative approaches, which come with a caveat of no proper capacity control, as they tend to latch on to any signal that may not generalize. Using various recent state-of-the-art approaches for text classification, we explore whether these models actually learn to compose the meaning of the sentences or still just focus on some keywords or lexicons for classifying the document. To test our hypothesis, we carefully construct datasets where the training and test splits have no direct overlap of such lexicons, but overall language structure would be similar. We study various text classifiers and observe that there is a big performance drop on these datasets. Finally, we show that even simple models with our proposed regularization techniques, which disincentivize focusing on key lexicons, can substantially improve classification accuracy.

* Proceedings of COLING 2018, the 27th International Conference on Computational Linguistics: Technical Papers (COLING 2018), NIPS 2017 Workshop on Deep Learning: Bridging Theory and Practice

Via

Access Paper or Ask Questions

When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?

Apr 18, 2018
Ye Qi, Devendra Singh Sachan, Matthieu Felix, Sarguna Janani Padmanabhan, Graham Neubig

Figure 1 for When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?

Figure 2 for When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?

Figure 3 for When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?

Figure 4 for When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?

The performance of Neural Machine Translation (NMT) systems often suffers in low-resource scenarios where sufficiently large-scale parallel corpora cannot be obtained. Pre-trained word embeddings have proven to be invaluable for improving performance in natural language analysis tasks, which often suffer from paucity of data. However, their utility for NMT has not been extensively explored. In this work, we perform five sets of experiments that analyze when we can expect pre-trained word embeddings to help in NMT tasks. We show that such embeddings can be surprisingly effective in some cases -- providing gains of up to 20 BLEU points in the most favorable setting.

* NAACL 2018

Via

Access Paper or Ask Questions

XNMT: The eXtensible Neural Machine Translation Toolkit

Mar 01, 2018
Graham Neubig, Matthias Sperber, Xinyi Wang, Matthieu Felix, Austin Matthews, Sarguna Padmanabhan, Ye Qi, Devendra Singh Sachan, Philip Arthur, Pierre Godard, John Hewitt, Rachid Riad, Liming Wang

Figure 1 for XNMT: The eXtensible Neural Machine Translation Toolkit

Figure 2 for XNMT: The eXtensible Neural Machine Translation Toolkit

Figure 3 for XNMT: The eXtensible Neural Machine Translation Toolkit

This paper describes XNMT, the eXtensible Neural Machine Translation toolkit. XNMT distin- guishes itself from other open-source NMT toolkits by its focus on modular code design, with the purpose of enabling fast iteration in research and replicable, reliable results. In this paper we describe the design of XNMT and its experiment configuration system, and demonstrate its utility on the tasks of machine translation, speech recognition, and multi-tasked machine translation/parsing. XNMT is available open-source at https://github.com/neulab/xnmt

* To be presented at AMTA 2018 Open Source Software Showcase

Via

Access Paper or Ask Questions

Class Vectors: Embedding representation of Document Classes

Aug 02, 2015
Devendra Singh Sachan, Shailesh Kumar

Figure 1 for Class Vectors: Embedding representation of Document Classes

Figure 2 for Class Vectors: Embedding representation of Document Classes

Figure 3 for Class Vectors: Embedding representation of Document Classes

Figure 4 for Class Vectors: Embedding representation of Document Classes

Distributed representations of words and paragraphs as semantic embeddings in high dimensional data are used across a number of Natural Language Understanding tasks such as retrieval, translation, and classification. In this work, we propose "Class Vectors" - a framework for learning a vector per class in the same embedding space as the word and paragraph embeddings. Similarity between these class vectors and word vectors are used as features to classify a document to a class. In experiment on several sentiment analysis tasks such as Yelp reviews and Amazon electronic product reviews, class vectors have shown better or comparable results in classification while learning very meaningful class embeddings.

Via

Access Paper or Ask Questions